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that F(Mo) = Mo and F(M \ Mo) = M \ Mo- Under some large deviations assumptions on the random 
perturbations, we show that, if there exists a positive attractor for F (i.e. an attractor for FinM\ Mo), then 
the weak* limit points of /i e are supported by the positive attractors of F. To illustrate the broad applicability 
of these results, we apply them to nonlinear branching process models of mctapopulations, competing species, 
host-parasitoid interactions, and evolutionary games. 
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1. Introduction 

A fundamental issue in biology is what are the minimal conditions to ensure the long-term survivorship for all of 
the interacting components whether they be viral particles, bio-chemicals, plants, or animals. When these condi- 
tions are met the interacting populations are said to persist or coexist. Since the pioneering work of Lotka [1925] 
and Volterra [1926] on competitive and predator-prey interactions, Thompson [1924], Nicholson and Bailey [1935] 
on host-parasite interactions, and Kermack and McKcndrick [1927] on disease outbreaks, nonlinear difference and 
differential equations have been used to understand conditions for persistence of interacting populations. For these 
deterministic models, persistence is often equated with an attractor bounded away from the extinction states in which 
case persistence holds over an infinite time horizon [Schreiber, 2006b]. In reality, extinction in finite time is inevitable 
due to finite population sizes and mortality events occurring with positive probability. However, for systems with 
large number of individuals or particles, these times to extinction may be sufficiently long that the system remains 
in a "metastablc state" bounded away from extinction for a long time. To provide a rigorous mathematical basis for 
this intuition, we develop a general theory for randomly perturbed dynamical systems with absorbing states. Under 
the appropriate assumptions about the random perturbations, we show that the existence of a positive attractor 
(i.e. an attractor which is bounded away from extinction states) for the unperturbed system implies two things as 
the number of individuals or particles gets large. First, when they exist, quasi-stationary distributions concentrate 
on the positive attractors of the unperturbed system. Second, the expected time to extinction for systems starting 
according to this quasi-stationary distribution grows exponentially with the system size. In particular, we generalize 
earlier related work for one-dimensional randomly perturbed dynamical systems [Hognas, 1997, Klebaner et al., 1998, 

1 

imsart-generic ver. 2011/11/15 file: QSD_arxiv_nov2013.tex date: November 22, 2012 



/QSD for random perturbations 



2 



Ramanan and Zeitouni, 1999] to higher dimensional systems by extending a general theory of randomly perturbed 
systems without absorbing states [Kifer, 1988, 1989, 1990] to a general theory of randomly perturbed systems with 
absorbing states. 

For the unperturbed, deterministic dynamics, we consider a bounded continuous map F : M —> M, where M is a 
closed subset of M. d . A random perturbation of F is a family of homogeneous Markov chains {X £ } £> o on M, whose 
transition kernels 

p £ (x, T) = P [X e n+l G T I XI = x] , x G M, T Borel subset of M 
enjoy the following standing hypothesis: 

Standing Hypothesis 1.1 For any 5 > 0, lim e ^o Ps(£) = where 

I3 5 (e) = sup?/ (x,M\N s (F(x))) 

and N S (A) := {x G M : m£ y ^A \\% — y\\ < 6} denotes the 5 -neighborhood of A. 

All Standing Hypotheses are assumed to hold throughout the paper. Standing Hypothesis 1.1 implies that p £ (x, ■) 
converges uniformly to the Dirac measure 5p^ x ) at F(x) for the vague convergence of measures, i.e. for any continuous 
function g : M — ► R with compact support, 



lim sup 

£ -> ieM 



g(y)p e (x,dy) -g(F(x)) 



M 



0. 



Consequently, for small e > 0, the asymptotic behavior of the Markov chain {X 6 l }^' =1 should be related to the 
dynamics of iterating the map F. 

When the perturbed system admits an invariant measure (e.g. the Markov chains are irreducible), the corrcspon- 
dance between the deterministic dynamics and the perturbed counterpart was intially studied by Andronov et al. 
[1933], and more recently by Sinai [1972], Rucllc [1981], Frcidlin and Wentzell [1984] and Kifer [1988, 1989, 1990]. 
Recall that a Borel probability measure fi e on M is called a stationary distribution for p £ if 



f measure pL s on M is called a stationary distribution f 

/ p e (x 7 T)fi e (dx) = jti e (r), for any Borel set T C AL 
Jm 



These invariant measures describe the long-term statistical behavior of the perturbed system. Let us assume, for 
a moment, that for all e > 0, the Markov chain X E admits (at least) one invariant measure /i e . We call pi a 
limiting measure of the family of measure {/x e } e >o if fJ- is the weak* limit of some sequence {a*s„}5^Li, where e n 
decreases to zero. Natural questions about these limiting measures include: Are the limiting measures invariant for 
the deterministic dynamics? If so, what can be said about their support? 

Kifer [1988, 1989, 1990] considered these questions under the assumption that the transition kernels p £ satisfied 
the following large deviation assumption: there exists a continuous, non-negative rate function p such that 

lim e\ogp e (x, U) = — inf p(x,y), (1) 

for any open set U C M and uniformly in x G M. Under suitable assumptions, Kifer proved that limiting measures are 
invariant for F (i.e. p,(T) = /J,(-F _1 (r)) for all Borel set T) and are supported by the attractors of the deterministic 
dynamics. In particular, Kifer's approach allowed him to derive some of Freidlin and Wentzell's results on the 
asymptotic behavior of invariant measures for diffusion processes X s generated by operators of the form L € = eL + b 
where L is a "good" second-order elliptic differential operator and b a vector field [Freidlin and Wentzell, 1984, 
Chapter 6]. 
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While Kifcr's results are applicable to a wide class of stochastic models for the physical sciences, they are not 
applicable to many models in ecology, epidemiology, immunology, and evolution. These stochastic models often have 
absorbing states Mo C M corresponding to the loss of one or more populations that satisfy the following standing 
hypothesis. 

Standing Hypothesis 1.2 The state space M can be written M = Mo U Mi, where 

• Mo is a closed subset of M ; 

• Mo and Mi are positively F -invariant, i.e F(Mo) Q Mo and F{M{) C Mi; 

• the set Mo is assumed to be absorbing for the random perturbations: 

p e (x, Mi) = 0, for all e > 0, x G M . (2) 

For many of these biological models, the set Mq of absorbing states is reached in finite time almost surely for any 
£ > 0. Despite this eventual absorption, the process {X^ l }'^L 1 may spend an exceptionally long period of time in the 
set Mi of transient states provided that e > is sufficiently small. In applications, this "metastable" behavior may 
correspond to long-term persistence of an endemic disease, coexistence of interacting species, or maintenance of a 
genetic polymorphism. One approach to examining metastable behavior are quasi-stationary distributions which are 
invariant distributions when the perturbed process is conditioned on non-absorption. More specifically, 

Definition 1.3 A probability measure [i £ on Mi is a quasi-stationary distribution (QSD) forp e provided there exists 
X £ G (0,1) such that 

/ p e (x, T)fi e (dx) = X e fi s (T) for all Borel T C M x . 

Equivalently, dropping the index e, a QSD \i satisfies the identity 

/i(r) = P M (X n G r | X n G Mi) , Vn, 

where P M denotes the law of the Markov chain {X n }^L , conditional to Xo being distributed according to /i. Quasi- 
stationary distributions can sometimes be defined through the so-called Yaglom limit: 

H(T) = lim (X n G r | X n G Mi) , 

n—t+oo 

when the limit exists and is independent of the initial state x £ Mi. When F(X± G •) = /i(-), A is the probability of not 
being absorbed in the next time step. The existence of QSDs has been studied extensively [Darroch and Seneta, 1965, 
Seneta and Vere- Jones, 1966, Tweedie, 1974, Barbour, 1976, Nummelin and Arjas, 1976, Arjas et al., 1980, Kijima, 
1992, Ferrari et al., 1995, Chan, 1998, Lasserre and Pearce, 2001, Gosselin, 2001, Coolen-Schrijner and van Doom, 
2006, Buckley and Pollett, 2010]. 

Hognas [1997], Klcbaner et al. [1998], Ramanan and Zeitouni [1999] studied weak* limit points fj, of QSDs [i e 
as e — > for maps of the interval i.e. M = [0, 1] and Mo = {0}. Under suitable assumptions, these authors have 
shown that if F admits an attractor in (0, 1), then the limiting measure \i is F- invariant and concentrated on the 
attractors of F in (0, 1). Moreover, \ £ > 1 — e~ c l e for an appropriate constant c > 0. This final assertion implies that 
if the perturbed processes is initiated in the quasi-stationary state, then the expected time to absorption increases 
exponentially with exponent 1/e as e decreases to zero. 

Here, we extend these types of results to higher dimensional systems where M is a subset of R d . The two main 
results of the paper are stated in Section 2. First, we state a general result that ensures that the QSDs concentrate 
on attractors of F restricted to Mi . This result requires conditions on the topological dynamics and the rate at which 
j3&{e) in Standing Hypothesis 1.1 goes to zero. Second, for many applications, the randomly perturbed Markov chains 
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satisfy large deviation assumptions. We present a result that guarantees the conditions of the general theorem are 
satisfied. Proofs of these two results are presented in Sections 3 and 4, respectively. In Section 3, we also show how 
the main result of Klebaner et al. [1998] can be derived from our general theorem. In Section 5, we apply our results 
to stochastic models of metapopulation dynamics, competing species, host-parasitoid interactions, and evolutionary 
games. In Section 6, we conclude by verifying the large deviation assumptions for the examples in Section 5. 

2. Statement of the main results 

Let {X e } e> o be a family of Markov chains on a closed set M C R d , which satisfies Standing Hypothesis 1.2. Since 
M is assumed to be a closed subset of R d , every topological concept must be understood in terms of the topology 
induced in M. In particular, in the following, a compact set K will always be a closed (in M) bounded set K C M. 

We assume that, for each e > 0, there exists at least one QSD \i e : there exists 1 > A e > such that \ E \i e = Q £ ' a e 
where Q £ is the operator defined on the set of finite Borel measures on Mi by 



Our main results concern characterizing the support of weak* limit points fj, of the /i e as e 1 0. Under suitable 
assumptions, we show that these weak* limit points are supported by attractors of the map F that lie in Mi (see 
2.1 for a definition of an attractor). In Section 2.1, we describe sufficient conditions for this result with suitable 
assumptions about the topological dynamics of F and /3a (e) in Standing Hypothesis 1.1 goes to zero. In Section 2.3, 
we describe large deviation assumptions which satisfy the assumptions presented Section 2.1 and which are easier to 
verify for applications presented in Section 5. 

2.1. Absorption-preserving chain recurrence and convergence to attractors 

We begin by recalling a few definitions from dynamical system theory. Let F n be the n-iterate of F. A set B C M is 
invariant for F if F(B) — B. A compact set A is an attractor for F provided there exists an open neighborhood U of 
A such that n n >iF n (U) = A and, for any open set V D A, there exists n(V) such that F n {U) C V for all n > n(V) 
The key notions needed for our main result is absorption preserving pseudoorbits and chain recurrence introduced in 
Jacobs and Schreiber [2006]. These definitions generalize Conley [1978] 's notion of pseudoorbits and chain recurrence. 
Given S > 0, a family of points £ = (£o, • • • , € M n+1 is called an absorption preserving S-pseudoorbit joining x to 
y (ap (5-pseudoorbit for short) provided that 

a) % = f , V = £n, 

b) & 6 M => 6+i e M , and 

c) d(&+i,F(&)) <5, i = 0,...,n-l. 

One can think of ap <5-pscudoorbits as approximations of actual orbits of the dynamics of F with an error no greater 
than S and that preserve the absorbing set Mo. For readers unfamiliar with these concepts, consider F to be the 
identity map on the interval [0, 1]. Then any two points on the interval arc connected by <5-pscudoorbits, for any 
5 > 0. However, as every point is a fixed point, none of the points are connected by interating the map F. 

Given x, y € M, we say that x ap-chains to y and write x < ap y if for any S > there exists an ap (5-pscudoorbit 
joining x to y. Notice that no point in Mo ap-chains to any point in M\. If x < ap y and y < ap x we shall write 
x ^ ap y. If x ^ ap x, then x is an ap-chain recurrent point. The set TZ ap of ap-chain recurrent points is F-invariant.The 
relation ~ aJ ,, restricted to this set defines an equivalence relation. The equivalent classes, [x] ap with x £ 7t api are 




p e {x,T)n{dx) for every Borel Y C M\. 
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called ap-basic classes. In Section 3.1, we prove various properties of these equivalence classes e.g. u>(x) C TZ a p 
whenever uj(x) C M or uj(x) C M\. 

Let [x] ap and [y] a p be two distinct ap-basic classes. We write [x] ap < a p [y]ap if x < ap y. A maximal basic class 
[x] ap (i.e. [x] ap < a p [y}ap implies that [x} ap = \y]ap) is called an ap-quasiattractor. In general, an ap-quasiattractor 
need not be an attractor for F. A simple example is an increasing function F : [0, 1] — > [0, 1] with F(x) = x for 
x = 1 — 1/n for all natural numbers n and F(x) ^ x otherwise. If Mo = {0}, then x = 1 is a quasi-attractor but not 
an attractor for F. 

We need three hypotheses in order to state the first main result. The first hypothesis requires that there are a 
finite number of ap-basic classes including at least one ap-quasiattractor in Mi. This assumption is satisfied for 
many important classes of mappings including gradient-like systems and Axiom A systems. When this hypothesis is 
satisfied, we prove in Section 3.2 that all the ap-quasiattractors are in fact attractors. 

Hypothesis 2.1 There exists only a finite number of ap-basic classes in Mi : { If; }j=i ,...,,). Moreover, we assume 
that they are closed sets and {-fQ}i=i,....£ with £ > 1 are the ap-quasiattractors and {-fQli^+i,...,^ are the non ap- 
quasiattractors. 

Our second hypothesis ensures the time spent near non ap-quasiattractors is not too long relative the Ps{e) 
described in Standing Hypothesis 1.1. For any Borel set V, we define the first passage time t v = min{n, : ^ V}. 

Hypothesis 2.2 Given any 5 > 0, there exist neighborhoods Vi C N s (Ki) of Ki for I + 1 < i < v and quantity 
5i G (0, 1) such that 

Ty. > h(e) < ((e) and lim ((e) = 



sup . 

xGVj 



for a function h satisfying 



lim h(e)p Sl (e) = 0. 



Our final hypothesis provides a lower bound on the probability of absorption on the event is sufficiently close 
to M . 

Hypothesis 2.3 There exists a neighborhood Vq of Mo such that 

lim- ^# , M= 0. 
e^o mi x(i v a P £ (x, Mo) 

We prove in section 3 that, if Mo is a global attractor, then /j, is supported by Mo (see Theorem 3.12). The main 
result of this section is the following theorem. A proof is given in Section 3. 

Theorem 2.4 Assume that Hypotheses 2.1 and 2.2 hold. Then any weak* limit point /i o/{/i e } e >o satifies = 

for j = I + 1, v. In addition, if Hypothesis 2.3 holds, then /i is supported by the union of the attractors U| =1 ifj. 
Moreover, there exists a 5 > such that X e > 1 — /3s (e) for all e > sufficiently small. 



2.2. Large deviation hypotheses 

For applications, it is often easier to verify certain large deviation hypotheses rather than Hypotheses 2.2 and 2.3. 
To this end we consider the following large deviation hypothesis. 

Hypothesis 2.5 There exists a function p : M x M — > [0, +oo] such that 

(i) p is continuous on Mi x M , 
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(ii) p(x, y) = if and only if y = F(x), 
(Hi) for any j3 > 0, 

inf {p{x, y): x € M, y 6 M, d(F(x), y) > /?} > (3) 

where d(x,y) = max, |xj — 
(to) for any open set U , we have the large deviations lower bound 

liminf elogp e (x, U) > — inf p(x,y) (4) 

i/iai ZioZrfs uniformly for x in compact subsets of Mi whenever U is an open ball in M. Additionally, for any 
closed set C , we have the uniform upper bound 

limsup sup e\ogp £ (x,C) < — inf p(x,y). (5) 

e^O x£AI V^ c 

Equations (3) and (5) imply that Standing Hypothesis 1.1 holds. Additionally, since Mo is absorbing, (4) implies 
that p(x,y) = +oo for all x S Mo, y 6 M\. The upper bound (5) can be weakened as a uniform bound on compact 
subsets of Mi. In that case, Hypothesis 1.1 is no longer implied by Hypothesis 2.5. 

We also make the following assumption that ensures absorption is reasonably likely when the process is near the 
absorbing states. 

Hypothesis 2.6 For any c > 0, there exists an open neighborhood Vq of Mq such that 

lim inf e logp e (x, M ) > -c. (6) 

e— >0 xGVb 

To state our main result under these large deviation assumptions, we need to introduce an alternative notion of 
chain recurrence. Given n eW = {0, 1,2,3,... }, define the function A n on M n+1 = M x ■ ■ ■ x M by 

n + 1 times 

n-i 

£ = . • . ,M ^ MO = E pfo>M- 

A n measures the "cost" of Xf t following the partial trajectory £ where the cost is measured in terms of how much 
"noise" is required to move along this partial trajectory. For any x, y in M , we define 

B p (x, y) = inf {MO | n > 1, f G M n+1 , ^ - x, £ n = y) . 

The function B p (x,y) represents the minimal cost in going from x to y. B p induces a partial order on M by writing 
x < p y (i.e. u x p-chains to y") if B p (x,y) = 0. Roughly, x p-chains to y if there exist paths joining x to y with 
arbitrarily low costs. If x < p y and y < p x, we write x ~ p y. 

We define the set of p-chain recurrent points 1Z P to be the set of points x E M such that x ^ p x. The p-basic 
classes are the equivalence classes for ~ p restricted to the p-chain recurrent set. Since a point in Mq never p-chains 
to a point in Mi, the p-basic classes are included either in Mq or in Mi. In general, the p-basic classes and the 
ap-basic classes introduced in Section 2.1 need not be equivalent. For example, consider F : [0, 1] — > [0, 1] given by 
the identity map F(x) = x for all x and M = 0. Let p(x,y) — \x — y\. Then every point {x} is a p-basic class but 
the only ap-basic class is [0, 1]. However, unlike this example, if there are a finite number of p-basic classes, then we 
prove in Section 4 (see Theorem 4.12) that the ap-basic classes and p-basic classes agree. 

imsart-generic ver. 2011/11/15 file: QSD_arxiv_nov2013.tex date: November 22, 2012 



/QSD for random perturbations 



7 



Given a p-chain recurrent point x, let [x] p denote its p-basic class. We say that [x] p < p [y] p if x < p y and call p- 
quasiattractors the maximal p-equivalence classes. When a p-quasiattractor A is isolated (i.e. there is a neighborhood 
of the quasi-attractor containing no other p-chain recurrent point), we prove in Section 4 that A is an attractor for 
F (see Proposition 4.6). 

In Section 4, we use Theorem 2.4 to prove the following result. Applications of Theorem 2.7 are given in Section 

5. 

Theorem 2.7 Assume that Hypotheses 2.5 and 2.6 hold and that there exists a finite number of p-basic classes in 
Mi, which are closed. If 

• there is at least one p-quasiattr actor A among the p-basic classes in Mi, and 

• p, £ {U) > for any neighborhood U of A and e > 0, 

then any weak*-limit point of {p £ }e>o * s F -invariant and is supported by the union of p-quasiattractors in Mi. 
Moreover, there exists c > such that X £ > 1 — e~ c / £ for all e > 0. 

Remark 2.8 Assume that there is a finite number of closed non-quasiattr actors [x\] p , [xjvjp in Mi and A = 
(1Z P n Mi) \ U^fi\xi\ p is an attractor for F. Then the main result still holds: if p £ (U) > for any neighborhood U 
of A and e > 0, then any weak*-limit point of {p e }e>o « s F -invariant and is supported by A. Moreover, there exists 
C > such that A E > 1 - e" c/e for all £ > 0. 

3. Proof of Theorem 2.4 

In this section, we prove Theorem 2.4. We begin by proving several key results about ap-chain recurrence in Section 
3.1 and 3.2. In Section 3.3, we prove some key properties of limiting quasi-stationary distributions. A proof of 
Theorem 2.4 is given in Section 3.4. In Section 3.5, we show how our proof of Theorem 2.4 provides an alternative 
proof of the main result of Klcbancr ct al. [1998]. In addition to the Standing Hypotheses, the results in Section 3.2 
require Hypothesis 2.1, and the proofs in Sections 3.4 and 3.5 require Hypotheses 2.1, 2.2, and 2.3. 

3.1. Absorption preserving chain recurrence 

We recall a few definitions and facts from dynamical systems. The uj-limit set of B C M is given by uj(B) = 
r\ n >iUp> n FP(B). It is the maximal invariant set in the closure of U n >oF n (B) . An equivalent definition of an attractor 
presented in Section 2.1 is that a compact set A is an attractor for F provided it admits an open neighborhood U 
such that oj{U) = A; the open set {x € M : oj(x) C A} is then called the basin of attraction of A. By a classical 
result (see Conley [1978]), a compact set A is an attractor for F if and only if there exists an open set V which 
contains A and such that 

F(V)czV, n nen F n (V) = A. (7) 

Our assumption that \\F\\ = sup TGA/ \\F (x)|| < oo implies that the set lZ ap of ap-chain recurrent points is included 
in aII f "(0). The relation ^ ap , restricted to this set defines an equivalence relation. Unlike the ap-basic classes lying 
in Mo, the ap-basic classes lying in Mi may not be closed. However, we have the following: 

Lemma 3.1 Let x be an ap-chain recurrent point in Mi. Then [x] ap C Mo U [x] ap . In particular, 

[x] ap C Mi => [x]a P closed. 
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Proof. Let y £ [x] ap . There exists a sequence {yk} in [x] ap which converges to y. Any ap i5-pscudoorbit from x to yk 
is an ap 2(5-pseudoorbit from x to y, provided k is chosen large enough. Hence x < ap y. On the other hand, assume 
that y ^ Mo and consider an ap <5-pseudoorbit (£o, •■•,£») chaining yk to x. We have 

d(F(y)^ 1 )<^ + d( J F(y),F( 2 / fc ))<25, 

by continuity of F provided k is large enough. Consequently, (y, £1,... , £ n ) is an ap 25-pseudoorbit chaining y to a; 
and y € [x] ap . ■ 

The following lemma shows that ap-basic classes are invariant. 

Lemma 3.2 Any ap-basic class [x] ap is positively F-invariant: F([x] ap ) Q [x] ap . If [x] ap C M± (which implies that 
[x] ap is closed), it is F-invariant: F([x] ap ) = [x] ap . 

Proof. If [x] a p is a singleton, then F(x) = x and there is nothing to prove. Assume that there exists y =^ x such that 
y £ [x] ap . For any 5 > 0, continuity and boundedness of F(M) implies that there exists a 5/2 > 5' > such that 

d(z,F(x)) < <5'=> d(F(z),F 2 (x)) < 5/2 for all 2 e M. 

Pick an ap i5'-pseudoorbit (x = £0, £ij • • • > =2/) joining x to y. Since F(x)) < 5' 

d(F 2 (x)^ 2 ) < d(F 2 (x),F(^)) + d(F(^)^ 2 ) < 5/2 + 5' < 5, 

and (F(x),^2, • ■ • is an ap <5-pseudoorbit joining F(x) to y. Hence, F([x] ap ) Q [x] ap - 

Next, let us assume that [x] ap is closed in M\. For every y £ [x] a p, we need to prove that y = F(y') for some 
y' € [x] Qp . For any 5 > 0, choose an ap <5-pseudoorbit (£f )j=o,...,n((5) joining y to itself. Now choose a compact set 
if C Afi containing an open neighborhood of [x] ap . We prove in the next section (see Remark 3.7) that the families 
£ s can be chosen in such a way that they are contained in K . In particular the family £,^^_i admits an accumulation 
point y' as 5 — > 0. By continuity of F, F(y') = y and, therefore, y' ~ ap y ^ ap x. ■ 

For classical chain recurrence, o>(x) is contained in the chain recurrent set. While ap-chain recurrence shares this 
property whenever w(x) C Mo or oj(x) C Mi, in general it only satisfies a weaker property. 

Lemma 3.3 For x E M, uj(x) n H ap ^ 0. 

Proof. If x 6 Mo or tu(x) C Mi, then the classical result for chain recurrence implies uj{x) C 7Z ap . Suppose x € Mi 
and y £ u>(x) H Mq. Then w(y) C !Z a p- Since C lj(x), the result follows. ■ 

Lemma 3.4 If [x] ap is maximal then x < ap z if and only if z 6 [x] ap . In particular, any ap-quasiattr actor [x] ap is 
compact. 

Proof. Let z be such that x < ap z. To prove that z € [x] ap , we need to show that z < ap x. By Lemma 3.3, 
lo(z) n TZ ap ^ 0. Hence there exists z' € D TZ ap . In particular, x < ap z < ap z' . As z' G 1Z a p, maximality of [x] ap 
implies that z' € [x] ap . Thus z < ap x. In particular, if y S [x] ap , then the proof of Lemma 4.1 implies that y > ap x. 
Hence, y £ [x] ap and [x] ap is closed. ■ 

The next result is an easy consequence of Proposition 4.2 in Kifer [1988]. A closed ap basic set is said to be isolated 
in Mi if it admits an open neighborhood which is disjoint from any other ap basic class 

Theorem 3.5 Let [x] ap be an isolated ap-quasiattr actor in M\. Then [x] ap is an attractor. 
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3.2. Finiteness of the ap-basic classes 

Throughout this subsection, we require Hypothesis 2.1. Namely, there exists a finite number of ap-basic classes 
{Ki}i=i where the Ki are closed sets, {Ki}f =1 are ap-quasiattractors and {Ki}^ =e+1 are non ap-quasiattractors. 

The following result is proved in [Kifer, 1988, p. 217-8] for classical chain recurrence. We give a proof adapted to 
our settings for the convenience of the reader. 

Lemma 3.6 a) For any 9 > sufficiently small, there exists a quantity < 5(9) < 9 such that, if there is an ap 
S(9)-pseudoorbit (£oj • • • j 6n) satisfying 

d(£o,Ki) < 5(9), d(Cj,Ki) > 9 and d({, n , K v ) < 5(9) for some e {l,...,v}, j e {l,...,n} (8) 

then i ^ i' and Ky > ap Ki. 

b) For any 5' > 0, there exist 5 £ (0, 5') and no > 1 such that any ap 5-pseudoorbit of length greater than no must 
pass through N s ' (Tl ap ). 

Proof. Assume that, for any 5 > 0, there exists an ap (5-pseudoorbit (£q, . . . such that 

d(£ , Ki) < 5 and d(£ (a) , K v ) < 5. (9) 

Then there exists 5k i 0, y <E Ki and y' 6 K v such that lim^oo ^ = y and lim^oo = J/'- Hence d(F(y), £( h ) < 

5 k + d(F(y),F(&)) and d(F(^ {Sk) _ 1 ),y') < 5 k + d(^ Sh) ,y f ). Therefore, for any 5 > 0, (y,£*V . is 
an ap (5-pseudoorbit provided that k is large enough. This proves that K%> > ap Ki. As a consequence, if Kv > ap Ki 
docs not hold, this means that there exists some quantity 5 > such that, for any < 5 < 5, we cant have (9). Now 
pick 9 > smaller than 5. 

Now assume that i = i'. Choose 9 small enough such that N e (K{) C M\. Assume that there exist a decreasing 
sequence 5k I and ap <5fc-pseudoorbits (£q , . . . , ) such that (8) holds with £ = £ fe , <5 = <5fc and j = jfc. Without loss 
of generality, we may assume that limfc_ ) . 00 £q = x e Ki, lim^oo £* k = y £ K \ N e (Ki) and limfc^oo £^ = z e 
where if C M x is a compact set such that F(N e (K l )) C if. We then have d(tf,F(x)) < 5 k + d(F($), F(x)), 
d(y,F(^ k _ 1 )) < d(y,$ k ) + 5 k , d(F(y),$ k+1 ) < d(F(y),F($J) + 5 k , and d(z,F(^ k _ 1 )) < d(y,$J + 5 k . By 

continuity of F, this implies that, for any 5 > 0, the sequence (x, , . . . , 2/) • ■ • > £n fe -ii z ) ^ s an a P ^ - 

pseudoorbit, provided fc is large enough. Consequently, x < ap y < ap z contradicting the fact that Ki is an ap-basic 
class. 

We now prove point b). For any x £ M and 7 > 0, Lemma 3.3 implies that the quantity 

n<(x) := inf {n e N : G N^(TZ ap )} 

is finite. By continuity of F, n 7 is uppcr-semicontinuous. Compactness of F(M) and upper scmicontinuity implies 
that n 1 :— max ie i/ n J (x) < nrax^ e J ^ A/ ^ n 1 (y) + 1 < 00. 

Now assume that there exists 5' > such that the statement of b) is not true. In particular, for each k there 
exists an ap 5 k = (5'//c-pseudoorbit of length n s / 2 , £ fe = (£q, . . . ,£ k 5 , /2 ), which does not enter N s (TZ ap ). Passing 
to a subsequence if necessary, we may assume that lim^oo £| = £j e M for any j = 1, n s / 2 . The sequence £ 

is a partial solution of the discrete dynamical system induced by F i.e. F(£i) = for i = 0, . . . , n s / 2 — 1. The 
definition of n s / 2 implies that there exists jo such that d(£j ,1Z ap ) < <5'/2. Hence, £* o <G N s (TZ ap ) for k large enough 
which contradicts the choice of £ fc . ■ 
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Remark 3.7 Notice that, even without the finiteness assumption, the following statement still holds: given an ap- 
basic class [x] ap in Mi and 8 > 0, there exists a quantity S > such that any ap 5-pseudoorbit joining [x] ap to itself 
remains into N ([x] ap ). 

Corollary 3.8 Given 5' > ; there exist isolating open neighborhoods {lt}»=i,...,« of the ap-basic classes {-Kj}j=i,. .., v , 
and positive constants S\ and no such that 

a) N Sl (I<i) c Vi for 1 < i < v, 

b) Any ap 8\-pseudoorbit starting in Vi remains in Vi for i = 1, . . . ,£. 

c) // there exists an ap S\-pseudoorbit (£q, . . . ,£„) such that £o £ N Sl (Ki), £ N Sl (Ki') and ^ Vi for some 
2 < k < n — 1 , then i ^ i' and Ki> > Ki . 

d) Any ap 5\-pseudoorbit of length greater than no must pass through N s (1Z ap ). 

Proof. Choose 9 £ (0, S') sufficiently small so that Lemma 3.6a holds and let 5(8) > be as given by Lemma 3.6a. 
Choose neighborhoods Vi of Ki such that N 9 (Ki) C Vi for i = 1, . . . , v and F(Vi) C Vi for i = 1, . . . , k. The latter 
choice is possible as Lemma 3.1 implies that the ap-basic sets Ki are compact for i = 1, . . . , v and Theorem 3.5 implies 
that Ki is an attractor for i = 1, . . . , I. Choose b~\ £ (0, S(8)) such that Si is less than the S given by Lemma 3.6b 
and such that any ap <5i-pscudoorbit starting in Vi for i = !,...,£ remains in Vi. This latter choice is possible as 



3.3. Limit behavior of quasi- stationary distributions 

Throughout this section, we assume that there exists a decreasing sequence e n ! such that, for every n £ N, 
a n is a quasi stationary probability measure for p £n with associated eigenvalue A„. Additionally, we assume that 
fj, n converges weakly to a Borcl probability measure fi. We note that the results in this subsection do not require 
Hypothesis 2.1 or Hypothesis 2.3. Recall from Standing Hypothesis 1.1 that /3a (e) = sup xeM p e (x, M \ N 5 (F(x))) . 

Lemma 3.9 We have the following: 

a) liminf n _ ) . 00 A rl > fj,(Mi). In particular, if fi is supported by M\, then lim„_ s . 00 A n = 1. Alternatively, if 
linin-xx, A n = then [i is supported by Mq. 

b) If there exists an attractor A C Mi such that u n (U) > for every n and every open neighborhood U of A, then 
there exists 5 > such that 



F(V) C Vi for i = !,...,£. 



V 




it 




^oo inf xeV o p £n (x, Mq) 



0. 



then /i(Vo) = 0. 

Proof, a) Let (6k)k be a positive sequence, decreasing to zero and define 

V k := {x £ M x : d[x, M ) > 6 k } , U k := F-\V k ). 
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Notice that {Uk)k is an increasing sequence of open sets such that U k C Mi (by _F-invariance of M ) and Ufc(t/fc) = Mi 
(by closedness of Mo). We have 



A„ > / ^ n (dx)p €n (x,Mi) 

> Li n {U k ) inf p e »(x,Mi) 
left 

= Hn(U k )(l - sup M )). 

test/* 

Since F(£4) C Vfc, we have N s "{F(U k )) C Mi. Thus 

Xn > »n(U k ){l - sup p e "(x,N 5 «(F(x)) c )). 
xeUk 

By weak* convergence, the definition of X n , and Standing Hypothesis 1.1. 

liminf A„ > liminf fi n (U k ) > n{U k ) 

n n 

for all k. Point a) follows since \im k n(U k ) = fJ.(Mi). 

b) Choose an open neig hborhood U of A such that F(U) C U and S > such that N S (F(U)) C J7. We have 

{U) > n n (U)(l - supp e -(x,U c )). 

Since p e "{x, U c ) < p £ "(x, (N 5 (F(U))) C ), and fi n {U) > 0, we get that A„ > 1 - p s (e n ). 
b'). By assumption, we have 

I -Men) < Xn 

(1 -p En (x, M ))ii n {dx) 

M 

< fx n (M \ V ) + Mn(Vb) f 1 - inf p E -(x, M ) 

which gives 

„ OM < 



mi xeVo p £ "(x,M )' 

Since Vq is open and lim,,-^ fi n = fi in the weak* topology, the result follows. ■ 

Remark 3.10 Notice that we actually have a better result, as the quantity f3s(s n ) could be replaced by the smallest 
quantity 

sup p^(x ,(N S (F(UW) 

Proposition 3.11 // lim, woo A„ = 1, then the probability measure ft is F -invariant. In particular, [i is supported 
by the closure oflZ ap - 

Proof. It suffices to verify that 



g(x)ti(dx) = / g(F(x))fj,(dx) (10) 

M JM 
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for any bounded continuous function g : M — > K. Uniform continuity of g on AT"- F '" + ' 5 (0) and Hypothesis 1.1 imply 



lim sup 



(g(y)-g(F(x)))pF-(x,dy) 



0. 



Therefore, 



/< 

Jm 



(g(x) - g(F{x)))n n (dx) 



An / g(y)p En (x,dy) - g(F(x))) n n (dx) 

M \ JM 



< 2(1 -A, 

< 2(1 -A, 



M \J M 



(g{y) - g(F{x)))p e "(x,dy) ^ n {dx 



sup 



(g(y) - g{F(x)))p en {x,dy) 



M 



Sending n to infinity implies (10) for any continuous bounded g. Hence, jj, is ^-invariant. _F-invariance of ft implies that 
the support of ijl is contained in the Birkhoff center of F, i.e. the closure of recurrent points of F: {x € M : ig oj(x)}, 
which is in turn included in the closure of lZ ap . I 

The following theorem provides a sufficient condition for the support of the limiting measure ft to lie on the 
absorbing set Mo. 

Theorem 3.12 Assume that Mq is a global attractor. Then ft is supported by Mq. 

Proof. If liminf Il _ i . 0O A rl = 0, Lemma 3.9 implies that [x(Mq) = 1. Assume that liminfn^oo A„ > and let c = 
inf „ A n > 0. Given a > 0, pick an open neighborhood U of Mq, 8\ > and no s N such that U C iV Q (Mo), 
F(U) C U, any ap <5i-pseudoorbit starting in U remains in U and any ap 5i-pseudoorbit of length at least no 
eventually enters U (see Corollary 3.8). 

Let £ nt k be the event \{Xj n )j=a l ... l k is an ap i5i-pseudoorbit }. Since a 5i-pseudoorbit of length at least no ends 
in U, we have 



< P x [S^ k ] + P [£ n , k and X{~ e U c ] 
fc-i 

< y £F w (d(Xfr 1 ,F(Xp))>8 1 )+0 

3=0 

< kf3 Sl (e n ) 

for k > no and x € M. The last inequality follows from the definition of f3s(e) and the second inequality from the 
fact that the event £^ k is included in the union of the k events {d (X?^, F(Xj n )) > Si}. By the definition of /i„, 



An JM 



fl n (dx)V x [X% 6 U C ] < 



n f3 Sl (£n) 



c n 



and the last quantity goes to zero as n goes to infinity. Since a > was arbitrary, ji{Mx) = 0. I 

Remark 3.13 The proof is not needed in the particular case where X n goes to one since fi is then F -invariant and 
the Birkhoff center is contained in Mq. 
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3.4. Proof of Theorem 2-4 

We assume Hypotheses 2.1, 2.2, and 2.3 hold. Recall that, under the finiteness assumption, the ap-quasiattractors 
{-Kj}i=i,...,f are actually attractors (see Theorem 3.5). Also, there exists 5q > such that A n > 1 — Ps {s n ) 
(see Lemma 3.9b). Let {Vi}i—i... v and Si < 5q be chosen as in Corollary 3.8. Given a Borel set V define t v = 
inf{j>0: Xf£V}. 

Call b = v — i the number of non ap-quasiattractors in M\ and K = U^ =1 Ki. Choose sequences {m T i}ri>i and 
{ m ' n }n>i such that 

TTl /l(^ n ) 

lim /3s 1 (£n)m n = 0, lim — — = and lim — = 0. 

n— >oo n-»oo m n n-toc m 

The presence of an attractor inside M\ such that \x n {U) > for any n and any open neighborhood U implies that 
lim rl _ ) . 00 A„ = 1, by Lemma 3.9b. Proposition 3.11 implies that fi is F-invariant and supported by the closure of lZ a p- 
Let us prove the first statement of Theorem 2.4. Let j G {£ + 1, . . . , v} be fixed. By definition of A„, 

Mn(^) = 4r / ft n {da)F x [X^ G Vj] , Vr G N*. 

For i = {1, . . . , &}, call t % n the integer [?ri n /ij ■ Let £ n and £^ be the events respectively defined by 

£ n = {(^")i=i,..., m „ is a <5i-pseudoorbit} 

and 

^ = {Vi G {l + 1, . . . , v}, Vq > m' n , XI- G N 5 > (*Q => g iV* 1 (K t )} . 

The set £^ is the event "after its first entry in any N Sl (Ki), the Markov chain will have escaped from this set after 
m' n steps and will never come back." 

On the event £ n C\£' n , the process (Xf n , . . .X^ ) is an ap <5i-pseudoorbit and therefore gets trapped in uf =1 Vi 
if it enters in this set. Corollary 3.8 implies that it cannot spend more than b blocks of length at most m' n in 
UV =e+1 N Sl (Ki). In particular, for n large enough, is in Vj only if X% G (N Sl (K)) c for some i G {1, ... ,b}. 

Therefore, 

fa 

p x [{x% n e Vj} pi £ n n £' n ] < Px [x e t r £ N s ^ (K) . 

i=l 

On the other hand on the event £ n , starting from N Sl (Ki), the chain cannot enter back into N Sl (Ki) once it 
exited Vi (by Corollary 4.9c). Hypothesis 2.2 implies 

V 

P x [(£' n ) c n £ n ] < V sup P y [t% > m' n ] < b((e n ), 
as m' n > h(e n ) for n sufficiently large. Consequently, 

fa 

p x [x% n g Vj] < p, l(£ n ) c ] + p, l(£' n ) c n £ n ] + £ Px [x;r i n Si (k) 

1=1 

b 

< m n f3 Sl (e„) + b({e n ) + £ P, [x£ £ iV* 1 (#) 

i=l 
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for n sufficiently large. Therefore we have, using the invariance properties of 

J H n (dx)P x [X% n e V 3 ] < m n J3 Sl (e n ) + K{e n ) J ^(dx)P x $ N Sl (K) 



i=l ' 
b 



< m n (3 Sl (e n ) + b((e n )+Y,>^Vn((N s ^K)) c ) 

i=l 

< m n /3 5l (e„) + b({e n ) + bfi n {(N 5 ' (K)) c ) 
which converges to as n —> oo. By our choice of the sequence m n , 

liminf A™" > lim inf (1 - fe(£ n )) m " = f. 

n— >oo n— ¥00 

Hence, linin^oc A™" = 1 and 

lim ^(Vj) = lim --L. f fx n (dx)F x [X% e Vj] = 0. 

The proof of the first statement is complete since fJ-(Vj) < liminf n fj, n (Vj). 

Now, under Hypothesis 2.3, Lemma 3.9 implies that the support of /1 is contained in M \Vq. In particular, 
fj.(lZ ap H Mi) = 1. Hence, fi(U^ =1 Kj) = 1 and the second statement of Theorem 2.4 follows. 



3.5. A derivation of Theorem 3 of Klebaner et al. [1998] 

We assume Hypotheses 2.1, 2.2, and 2.3 hold. We can obtain Theorem 3 of Klebaner et al. [1998] as a consequence 
of our proof of Theorem 2.4. To see why, we describe how their assumptions (A1)-(A6) imply our main assumptions. 
For the sake of brevity, we do not state their assumptions here. Rather we refer the interested reader to their 
article. Under their assumptions (A1)-(A6) , Klebaner et al. [1998] prove nonconvergence to the finite set of unstable 
equilibria for one-dimensional maps. Their assumptions (Al) and (A2) guarantee the following, calling Mq = {0, 1} 
to fit our settings 1 , we have : 

a) /(Mo)cMq and /(Mi) c M x ; 

b) the absorbing state {0}, the unstable equilibria (a;*)i=o ... fej and the stable equilibria (s^)j = i j form a Morse 
decomposition for the dynamical system induced by /; hence there is a finite number of ap-basic classes (see 
Proposition 5.1) and the ap-quasiattractors in Mi are the (sj)i = i ; ... ; /. Our Hypothesis 2.1 is verified. 

By their assumption (A4), we derive the uniform (in a;) upper bound 

3A > 0,C > such that P s (x, N s (f{x) c ) < Ce- x ° s/s , 

for any 5 > 0, e > 0. Hence our Standing Hypothesis 1.1 is satisfied, with fis(e) — Ce _Ao<5 / £ . 

There remains to check Hypothesis 2.2. The nonattractors in this case are the unstable equilibria (x*)j = o f.. Let 

61 be the positive parameter associated with the stable points (sj)j = i ; in our Corollary 3.8. By their assumption 

(A5), there exists /3 > such that we have, for j = 1, . . . , k, e and S small enough, 

inf (£ e (x) >e)>(3, inf (f(x) < -e) > f3, 



1 in the quoted paper, Mi = [0, 1] and Mo is [0, l] c but it does not change the problem as they consider continuous state space, see 
Ramanan and Zeitouni [1999] 
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where Vj = N s (x*). Call a n = 8/e n and assume without loss of generality that it is an integer. Using the fact that 
x* is an unstable equilibrium, for n large enough, 

inf P x {\X £ a " -x*\ >S) >(3 a ". 
Now choose < a < XqSi. We have, by Markov property, 



sup . 



where v n = 6 " e / " . By construction, e a ^ 6n (3s 1 (e n ) goes to zero as n goes to infinity. Additionally 

p p a/e np 6log(l3)/e„ 

^ +oo cxp < 

This quantity vanishes as n goes to infinity if we choose 8 small enough (more precisely, 8 must be chosen smaller 
than —a/ log/3). Therefore, we have verified Hypothesis 2.2 and we can apply our result to conclude that the support 
of weak* limit points of the QSDs do not include the unstable equilibria x*. 

4. Properties of p-basic sets and proof of Theorem 2.7 

In this section, we assume that the assumptions of Theorem 2.7 hold. We prove that Hypotheses 2.5, 2.6, and finiteness 
of the p-basic classes imply Hypotheses 2.1, 2.2 and 2.3. From these implications, it follows that Theorem 2.4, Lemma 
3.9 and Proposition 3.11 imply that Theorem 2.7 holds. Indeed, by Lemma 3.9, we obtain the lower bound for A e 
and, by Proposition 3.11, that any weak*-limit point of {fi e }e>o is -F-invariant. The fact that these limiting measures 
sit on p-quasiattractors follows from Theorem 2.4. 

Lemma 4.1 Hypotheses 2.5 and 2.6 imply Hypothesis 2.3. 

Proof. Let 8q > be given. Assertion (iii) of Hypothesis 2.5 implies that 

c = - in£{p(x, y):x,y€ M, d(F(x), y) > 8 Q } > 0. 

It follows from the definition of (3$ and inequality (5) that Ps ( £ ) ^ exp(— 3c/e) for e > sufficiently small. Hy- 
pothesis 2.6 implies that there exists an open neighborhood Vq of Mo such that p e (x, Mo) > exp(— 2c/e) for e > 
sufficiently small and x £ Vq. Hence, 

lim — < lim cxp(— c/e) = 0. 

e^inf aeVbJ >*(x,Mo) " e-K) 

■ 

For the remaining implications, we need to gain some insights about the relation between ap and p-chain recurrence. 
As their ap counterparts, the p-basic classes whose closure is in Mi are actually closed. This follows from the next 
two lemmas. We consider the quantity 

a* = sup inf {p(x,y) : x G M, y £ M, d(F(x),y) > 8} e (0, +oo]. 
5>o 
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Lemma 4.2 For any < a < a* , there exists S > such that, for any £ = (£o, • ■ • ,£n) satisfying A n (£) < a, we 
have < + 6 for i = 1, . . . ,n. In particular, 1Z P is bounded. 

Proof. Given < a < a*, there exists 5 > such that 

p{x,y) <a=> d(F(x),y) < 5. 

Therefore, if A n (^) < a, then < a for i = 0, . . . , n — 1, which implies that d(£i+i, Ffa)) < S for i — 

0,...,n-l. ■ 

Lemma 4.3 W^e /iaue i/ie following: 

(i) The function B p is upper semicontinuous on Mi x M , 

(ii) Let n G (0, a*) a?7,d y 6 M . J/ £/ie set {x G M : B p (x,y) < rj} has its closure in Mi then it is closed. 
(Hi) Let x G Mi. Assume that the set {y G M : B p (x, y) < rf\ has its closure in Mi for r\ small enough. Then there 
exists 7/0 such that, for any r/ < rjo, {y G M : B p (x,y) < 77} is closed. 

Proof. Part (i) is proved in [Kifer, 1988, Lemma 5.1, pg. 58]. It relies on the continuity of p on Mi x M. 

For part (ii), let {xk}k>i be a sequence of points in Mi, such that lim.fc_>.oo x k = x G Mi and B p (xk,y) < r\- Pick 
r > such that N r (x) C M x . For any 7 > and any k > 1 there exists £ 7 ' fc = (^ r,fc , . . . , Q' k ) such that ^ = x k , 
= y and A nk (C' k ) < V + 7- By Lemma 4.2, for 7 < a* - 77, there exists (5 > such that \\t%' k \\ < \\F\\ + S. 
Since N r (x) is a compact set contained in Mi and p is continuous on Mi x M, /? is uniformly continuous on 
N r (x) x NW p W+ s (Q). Thus 

lim |p(a;,^7' fc ) ~ p{xk,£,J' k )\ = 0. 

Therefore, 

B p (x,y) < limM(p( X ,ei' k )+p(fi'\& k ) + ---+p(Cti,y)) 
< \imM\p(x,tf< k )-p(x k ,ei' k )\+A nk (e> k ) < '7 + 7 

k— >oo 

Since this holds for any 7 > 0, part (ii) follows. 

Proof of part (Hi) is similar. However we have to be careful since p is not continuous on Mo xM. Let x G Mi be given 
and assume that there exists rj > such that K = {y G M : B p (x,y) < 77} C Mi. Define a = d(M rW l|F|1 (0), K) > 0. 
Since p is continuous on Mi x M and p(z, y) = if and only if y = F(z), there exists a > such that 

p(z, y) < a =>■ d(F(z),y) < min(a/2, 1) 

for all z, y G M. Let 770 = min(a, rj) and choose < 77 < 770- We claim that {y G M : B p (x, y) < 7/} is closed. To see 
why, let {yk}k>i be a sequence in Mi such that lim/ ; _ i . 00 y k = y G Mi and B p (x,yk) < r\ for all k. For any 7 > 0, 
there exists a family £ 7 ' fc = (£j' k , . . .,Q' k ) such that Q' = x, Q' k = y k and A nk (^" t ' k ) < n + 7. For 7 < 770 - 77, 
p(£nL-i> J/fe) < Therefore d(F(^' k -i),yk) < min(a/2, 1), which implies that d(F(^' _ x ), Mo) > a/2. By continuity 
of _F and f-invariance of Mi, the sequence {£J lk k _i} k is bounded away from Mo. Since 770 < a>, Lemma 4.2 implies 
that there exists 6 > such that _i}fc ^ iVll F H +d (0). The remainder of the proof is as for part (ii), with d'k-i 
playing the role of t;J' k . I 

Since boundedness of p-basic classes follows from Lemma 4.2, Lemma 4.3 implies that given a p-chain recurrent 
point x, if [x] p C Mi then [x] p is compact. Clearly, if x is p-chain recurrent and [x] p is closed then x is ap-chain 

imsart-generic ver. 2011/11/15 file: QSD_arxiv_nov2013.tex date: November 22, 2012 



/QSD for random perturbations 



17 



recurrent and [x] p C [x] ap but the converse is not true in general. For example consider a situation where the ap 
(5-chains joining x to itself have arbitrarily large length as 5 goes to zero, in which case we could have x ~ ap x but 
x x. While Remark 3.7 holds for p-chain recurrence, it is not immediate and therefore we provide a short proof. 
In the sequel, we will call <5-r/io-pseudoorbit any family £oj •••)£« such that A n (£) < 5. 

Lemma 4.4 Let [x] p be a closed p-basic class in M\. For any 8 > 0, there exists S > such that any S-p-pseudoorbit 
joining [x} p to itself is contained in N ([x] p ). 

Proof. Pick 8 small enough so that N e ([x] p ) C M\. Since F is Mi-invariant and N s ([x] p ) is compact and contained 
in Mi , so is its image by F. Hence, by closedness of Mo, there exists a compact set A C Mi, which contains the 
7-neighborhood of F(N e ([x] p )), for some 7 > 0. Assume, by contradiction, that there exist a decreasing sequence 
6k I and 4-p-pseudoorbits (£0 > • • • > £n k ) (i-e. A„ k < S k ) such that lim^oo = u € [x] p , limfc^oo = w e [x] p 
and jk = min{j > 1 : £j£ N (\x] p )} < n^. For fc large enough, we have 

p{z,y) < S k => d(F(z),y) < 7 

for all z,y S M. Since S iV e ([a;]p), we have £^ S if for k large enough. By passing to a subsequence if necessary, 

£j k converges to some point v € K \ N e {[x\ p ). On the other hand, consider the pseudoorbits £ fc = (£q, . . . ,£* _xj w )- 
They satisfy 

lim < lim A„ fc (£ fc ) + Hm - P(&-i>#JI = 

A) — ^00 k — y 00 K — ^00 

due to A njs (£ ) < Sk and by uniform continuity of p on A x A. Hence, x < p v. Similarly, one can show that v < p x. 
Consequently, v E [x] p , a contradiction. ■ 

Lemma 4.5 The p-basic classes [x] p closed in Mi are invariant: F([x] p ) = [x] p . 

Proof. Since F(x) > ap x, F([x] p ) C [x] p follows if we prove that F(x) < p x. Since [x] p is compact and contained in 
Mi (see Lemma 4.2), we can find a compact set A bounded away from Mo, Sk 4- and a family of <5fc-/9-pseudoorbits 
( k = (fo. J in K such that £0 = Z.n k = a:- Consider the family £ fc = (F(x), $ , . . . , ^* fc _ 1 , x). By uniform 

continuity of /? on A" x A' and the fact that p(x,y) = if and only if y = F(x), we may assume by passing to a 
subsequence if necessary that limfc_ i . 00 = F(x). Hence 

lim A„ fc _i(| fc ) < lim A„ fc (f ) + IMA(x),e 2 fe ) - ,#)| = 

as A nfc (£ fc ) < <5fe and using uniform continuity of p on K x A'. Hence, F{x) < p x. 

For the inclusion [x] ap C A([x] ap ), pick y £ [x] p such that y ^= F(y) (if there is no such y, there is nothing to 
prove). For Sk I 0, choose a family of <5fc-p-pseudoorbits £ fc = (£q, . . . ) in A such that = = Passing 
to a subsequence if necessary, we can assume that Um^oo x = z G A'. Clearly, A(z) = y and z ^ p y. Hence, 
[s] p C F([x] p ). U 

The following proposition is a straightforward consequence of Proposition 5.1 in Kifer [1988]. 
Proposition 4.6 Let [x] p be an isolated p-quasiattr actor in M±. Then it is an attractor and [x] p = n^o-Dr/? where 

D v = {yeM: B p {x,y) < V }. 

Define the maximum distance on M" +1 by d„ ((, £) = maxj=o,..., n d(Q,£j) for (Co, ■ • ■ , Cn), (Co, • ■ • , £«) € M n+1 . 
The following theorem and lemma are analogous to the statements of Theorem 5.2 a) and Lemma 5.3 in [Kifer, 1988, 
pgs 66 and 72, respectively]. We provide a proof of Theorem 4.7 that slightly differs from the proof of Kifer. The 
proof of Lemma 4.8 follows directly from Kifcr's proof of his Lemma 5.3. 
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Theorem 4.7 Let K C M\ be a compact set. Given rj,5,N > 0, there exists eo > such that 

P x [d n ((X^ . . .,XZ),£) < rj\ > exp (-MQ± 5 

for any x G K , e < eq, n < N and £ = (£o, • • • , £n) G if ™ +1 £o = 
Proof. Analogous to Kifer's proof, we introduce the quantity 



,K 



sup{\p(y,z) -p(y',z')\ : y, y' € K, d(y, y') < 7, d(z, z') < 7} 



By uniform continuity of p on compact subsets of Mi x Mi, lim 7 _>o ri^ = 0. Let 77, <5, and N be given. Choose 
< 7 < 77 such that < 8/2N and N^(K) C M x . Now let £ = (£ = £,£1, •••,£«) G By the uniform lower 

bound (4) of Hypothesis 2.5 there exists a function g :]0, +oo[— >]0, +oo[ such that lim^o <?( e ) = and 

elog;> e (x,iV r (&)) > - inf p(x,y) - g(s) 

for any x € N"'(K) and 1 < £ < n. 
Hence we have 



[d n (X £ ,£) <?7 ] > V x [d n (X e ^)< 1 ] 

p e (x, dxi) 

ajiGJVT(6) 



p e (x n _i,dx n ) 



a;„eiVT(C„) 



> p £ (^,A r7 (a))nr i 1 inf p £ (x,,iVTfe + i)) 

Xi£N~>{£i) 



> exp 

> exp 



n-l 



£ \yeJVT(£!) ^ x i SAT-'(£i) 'y' 6ArT (^+i) , 



--(yl„(£)+ng(£) + nnf) 



The result follows by choosing £0 small enough so that g(e) < S/2N, for every e < £o- M 

Lemma 4.8 Let K be a compact set in M which does not contain any entire semiorbits {F l (x),i G N}. Then there 
exists a > and N G N ( which depend on K) such that 

a) For any sequence £ G K n with n > N, we have A n (£) > (n — N)a, 

b) there exists £ > such that, for any n > N and any < £ < £q, 

sup P x [r|r > n] < e * , 

where t e k = inf { j > : X? <£ K} . 

Recall that uj(x) = n„>iU m >„F m (x) and that a point x G M is called non-wandering if for all open neighborhoods 
Uofx and any iVeN, there exists n> N such that F n (U) flU ^ 0. We denote by 2VW(F) the set of non- wandering 
points of F. Note that w-limit points are always non- wandering: {y G M : y G cj(»), for some x} C iVW(F). 

Lemma 4.9 The set NW{F) n Mi «s contained in 1Z P . In particular, any u> -limit point in M\ is also in 1Z P . 
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Proof. Let x G Mi fl NW(F) and 6 > be given. By continuity of p in Mi x M and p(x, F(x)) = for all x, there 
exists 7 > such that 

p(x,F{y)) < 5/2 and p(z,x) < 5/2 
for y G A 7 (x) and A(z) G iV 7 (x). Since x is non- wandering, there exists n > 1 such that 

F"(iV 7 (x)) n A 7 (x) ^ 0. 

Pick y, z £ A 7 (x) such that F"(y) = z. Now consider the chain £ = (x, -F(y), . . . , F n ~ 1 (y), x). Since F{F n ^ 1 {y)) = 
z G -ZV 7 (x), we have A(£) = p(x,F(y)) + p{F n ~ x (y) , x) < (5. Taking 5 1 yields claimed. ■ 

Corollary 4.10 Assume that p is an F '-invariant probability measure whose support S lies in Mi. XTien S" C TZ p . 

Proof. By the Poincare recurrence theorem, S is included in the set 

{x G Mi : x G w(x)} C iVW(F). 

Applying Lemma 4.9 completes the result. ■ 

Corollary 4.11 Assume that 1Z P n Mi admits a neighborhood U , whose closure lies in some compact set A C Mi. 
Then there exists N G N such that any partial solution £ = (x, F{x), . . . , F n (x)) G K n+1 with n > N must pass 
through U . 

Proof. The set K \ U does not contain any entire semiorbit of F, by Lemma 4.9. Since A n (Q = 0, applying Lemma 
4.8a completes the proof. ■ 

We already stated that if [x] p is a closed p-basic class then [x] p C [x] ap . Under the finiteness assumption of 
Theorem 2.7, we derive the equality between ap and p-basic classes. Call Ki, K v the p-basic classes in Mi (recall 
that they are supposed to be closed) and label K\, . . . , Kf the quasi-attractors among them. Proposition 4.6 implies 
that K\,...,K# are attractors. The following lemma implies that finiteness of the p-basic classes in Mi implies 
finiteness of the ap-basic classes in Mi. In particular, Hypothesis 2.1 holds under the assumptions of Theorem 2.7. 

Theorem 4.12 Assume that there is a finite number of p-basic classes in M\. Then 1Z P fl Mi = lZ ap H Mi and 
[x] p = [x] ap for any x G 1Z ap l~l M x . 

Proof. Let x G lZ ap H M\. We prove that x G 1Z P fl Mi and [x] ap C [x] p . If [x] ap = {x}, then x is a fixed point and 
there is nothing to left to prove. Let y G [x] ap , y ^ x and a > 0. 

Remark 3.7 implies there exist a compact set K C Mi with UiKi C K, a sequence 5k \.k and a family 
£ fc = (Co = x > ■ ■ ■ j Cn fc — 2/) e A'™ fc+1 of ap <5fc-pseudoorbits joining x to y. 

Let 7 > be chosen so that B p (a, b) < a for alH = 1, . . . , v and a, b G N 7 (Ki), the closure of C7 = UiA r7 (A'i) is 
contained in A and A 7 (A^) is an isolating neighborhood for Ki for all i. Corollary 4.11 implies that there exists a 
positive integer N such that every partial solution {a, F(a), . . . , A n (a)} in A of length n > N must pass through {/. 
Consequently, by compactness of A^ and continuity of F, we can find ko such that £ fc can not have more than N 
consecutive terms in A \ U for k > ko . 

Now, given fe G N, define cro(fc) = and ro(fc) = min{j > : £j £ U}. For i > 1, define inductively the terms 
(Ji{k) = min{j > Tj_i(fc) : £/f G U} and Tj(fc) = minjj > ai{k) : £| ^ C7}. This defines two sequences {T i (fc)} i= o,...,p fc 
and {<Ti(fc)}i=o,...,q fc ■ Notice that = Pk if V ^ UjAi and % = pk + 1 otherwise. By truncating multiple entries of ap 
pseudoorbits into each set A 7 (Ai), we can assume that qk < v — 1. After truncation, these pseudoorbits may only 
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satisfy d(F($),$ +1 ) < 6 k for n {k) -l<j< a i+1 {k). Therefore 
Pk 

B P (x, y )<j^[B p (e ai{k) ,e m -i)+m^-^^ +B p^^^ 

in the case where y G UiA^, and 

Pk 

B P (x,y) < (Bp(C t{ k)^r m -i) + B P (g i{k) _ 1 ,& i{k) j) 

i=0 

Pk— 1 

+ (-®p(^Ti(fe)'^ j+ i(fe)-i) + 5 p(^ +1 (fc)-u^ i+1 (fc))) + B P (^ Pk ( k ),y) 

4=0 

otherwise. In cither case, our choice of 7 implies 

-B P (a;, y) < va + (v(N + 2) + 1) sup{p(a, 6) : d(F(a),b) < S k , a,b G A'} 

for fc sufficiently large. Uniform continuity of p on A x A implies that limfc^oo sup{ / o(a, b) : d(F(a), b) < 5 k , a, b G 
A} = and we obtain that B p (x,y) < va. Since this holds for any a > we get that x < p y. Similarly, y < p x, 
which yields x ~ p y. Therefore, x G 1Z P and [x] p = [x] ap . ■ 

The next proposition shows that Hypothesis 2.2 holds. 

Proposition 4.13 Let j G {£ + 1, . . . , v}. We can find 77 > such that, for any 7 > 0, there exists Eq > (which 
depends on rj and j) and a function £ on (0,£o) such that lim^o C( e ) = and 

sup F x [T e Nn{Kj) >e^ e ] <C(e) 

for any e < Eq. 

Proof. First of all, by definition of a non p-quasiattractor, there exists rj > such that the closure of N 2r] (Kj) 
belongs to Mi and, for any 7 > and any x G N v (Kj), there exists a sequence £ 7 = (£q, . . . , ^(-y)) sucn that 

^ = *> C( 7 ) t n2v ( K j) and A»(7)(D < 7- 

Call £/ = N 2r, (Kj). Since Mi is invariant by F, F(U) is compact and contained in Mi. Hence there exists r > and 
a compact set A C Mi such that 

N r {F(U)) G A. 

By continuity of p on ?7 x M and since p is strictly positive on the compact set U x (N r (F(U))) c , there exists 70 > 
such that 

p(x, y) > 70 for all x G J7, y G A" °. 

In particular, this means that, for 7 < 70, the sequence £ 7 must pass through A \ U and we can therefore assume 
without loss of generality that £1, ~. G K\U and £ 7 lives in A. 
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Pick 5 > 0. We now apply Theorem 4.7 in the compact set K, with 8, r\ and N = 71(7) : there exists £0 > (which 
depends on 77, 8 and n("f)) such that, for any e < eq, 



? x [d n(7) (X s ,C)<v] >cxp(- 



■y + 8 



Consequently there exists e > such that, for any < e < s (up to changing slightly 71(7)), we have 



r Nv(K j ) 



< 71(7) 



> e 



-7/e 



Consequently, 



T Ni(K J ) ^ e 



2 7 /e 



< 



H _g-7/£)[e 2T/ V™(7)]^ 



The last quantity is of order exp (— e 7 / 6 /(2n(7))) and therefore goes to zero. ■ 

By assumption of Theorem 2.7, there is at least one p-quasiattractor (which turns out to be an attractor by 
Proposition 4.6) among the p-basic classes included in Mi, such that /^ n (U) > for all n, for all open neighborhoods 
U. Hence, by Lemma 3.9 there exists 8q > such that A„ > 1 — (3s { £ n) > 1 — e~ c "/ £n , where 



co = - inf {p(x, y) : (x, y) & Mi x M, d{F(x), y) > 8 } > 0. 

Let 81 < 80 and {^i}i=i,...,u be chosen so that Corollary 3.8 holds. 

We are now ready to prove Theorem 2.7. Since Assumptions 2.1 and 2.3 are satisfied, it remains to verify Hy- 
pothesis 2.2. Choose the neighborhoods {V5}i=^+i ,« such that Vi C N v (Ki), where r\ is given by Proposition 4.13. 
Choose 7 < y where 

< ci = inf {p(x, y) : {x, y) £ Mi x M, d{F(x), y) > 81} < 2c . 

Define h(e) = e 1 ^ . Proposition 4.13 implies that there exists £0 > and a function f such that lim^o C( £ ) = 
and 

sup ^x[Tpr, (K .) > h(e n )} < C(e«) ->■ 
for any e < eq. Since limn^oo h(e n )!3s 1 (e„) = 0, Hypothesis 2.2 holds. This completes the proof of Theorem 2.7. 



5. Applications 

Our results are broadly applicable to many Markov chain models in population biology. To give some flavor of this 
applicability, we introduce two classes of Markov chains satisfying our probabilistic assumptions and some illustrative 
applications to metapopulation dynamics, competing species, host-parasitoid interactions, and evolutionary games. 
For each application there are two ingredients for verifying the conditions of Theorem 2.7. The probabilistic ingredient 
involves verifying that there exist quasi-stationary distributions and verifying the large deviation assumptions. We 
defer verifying these conditions until Section 6. The dynamical ingredient involves verifying there is a finite number of 
p-basic classes and identifying the attractors. For the second ingredient, we introduce a proposition that is applicable 
to most of our examples. 
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5.1. A dynamical proposition 

To state the proposition, we need a few definitions from dynamical systems. For x £ M, let oj(x) = {y : there exists 
iik oo such that lim/j : _ > . 00 F nk (x) = y} be the u -limit set for x and a(x) = {y : there exist — > oo and yk £ M 
such that F nk (yk) = x and lim^oo y^ = y} be the a-limit set for x. Our assumption that F is bounded implies that 
there exists a global attractor given by the compact, ^-invariant set A = D n >oF n (M). For all x £ A, ui(x) and a{x) 
are compact, non-empty, F-invariant sets. 

A Morse decomposition of the dynamics of F is a collection of F- invariant, compact sets K\, . . . , Kk such that 

• Ki is isolated i.e. there exists a neighborhood of Ki such that it is the maximal F-invariant set in the neigh- 
borhood, and 

• for every x £ A \ Uifj, there exist i > j such that cj(x) C Ki and a(x) C Kj. 

Modulo replacing the invariant sets Ki by points, one can think of F being gradient-like as all orbits move from 
lower indexed invariant sets to higher indexed invariant sets. 

Proposition 5.1 Assume Hypothesis 2.5 holds. If F admits a Morse decomposition K\, . . . , Kk such that 

• Ki C Mi or Ki C Mo for each i, and 

• Ki is transitive whenever Ki C M\ i.e. there exists x £ Ki such that {a;, F(x), F 2 (x), . . . } is dense in Ki, 

then p-basic classes in M\ are given by the Ki C M\. In particular, there is a finite number of p-basic classes in M\ 
and each of them is closed. 

Proof. Let K\, . . . , Kk be a Morse decomposition for F. Let I C {1, . . . , k} be such that Ki C Mi if and only i € I. 
By assumption, Ki C Mq for i £ I, and Ki is transitive for i £ I. Transitivity of Ki for i E I and continuity of p 
restricted to Mi x M implies that Ki is contained in a p-basic class for i G I i.e. x ~ p y for all x, y € Ki. As shown 
in Section 4, assertion (iii) of Hypothesis 2.5 implies that the p-basic classes are contained in the ap-chain recurrent 
set of F which is contained in lij-fQ. Hence, the p-basic classes in Mi are given by {Ki}i^j. ■ 

5.2. Nonlinear Poisson branching processes 

To describe structured populations with k types of individuals (e.g. different genotypes or species, individuals living 
in different spatial locations) , let x represents the vector of population densities which lies in the non- negative cone 
of R k . A widely used class of models in population biology [Caswell, 2001] are nonlinear matrix models of the 
form F(x) = A(x)x where A{x) are non-negative matrices representing transitions births, deaths, and transitions 
between types of individuals (e.g. due to mutation or dispersal). 

Since real populations involve finite numbers of individuals, these deterministic models can be viewed as ap- 
proximations of more realistic, stochastic representations of the population dynamics. More specifically, let 1/e that 
represents the "size" (e.g. area, volume) of the habitat. Let Nf e Z* denote the vector of population abundances 
where is the non-negative cone of the fe-dimensional integer lattice. Then Xf = eN^ £ eZ+ is the vector of 
population densities. For every x £ , let Z\(x), Z^ix), ... be a sequence of i.i.d. random vectors with independent 
components, and whose i component has a Poisson distribution with mean Xi. Given Nq £ ZiL , we can define the 
Markov chains {A t e } iteratively by 



t+i 



Z t+1 {A(XI)Nl) and X* +1 = eNl 



t+v 



Equivalently, we can write Af_ 



eZ t+l {F(Xt)/e). 
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A useful observation about these Poisson processes, from the modelling standpoint, is that multinomial sampling 
of a Poisson process still corresponds to a Poisson process. More specifically, consider a multinomial random vector 
(Xi, . . . , Xk) where the number of samples N is Poisson distributed with mean A > and the sampling probabilities 
are (pi, . . . ,p k ). Then 

F[X 1 =x 1 ,...,X k =x k ] = ¥[X 1 =x 1 ,...,X k =x k \N = x 1 + --- + x k ]P[N = x 1 +--- + x k ] 

— Pi ...p k k exp{-\)-— 



Xl \...x k \ ' "'{x 1 + ...x k )\ 

= n^pexp(- Pi A) 

i Xi . 

Hence, X\, . . . , X k are independent Poisson random variables with rate parameters p\\, ■ ■ ■ ,PfcA. We make repeated 
use of this observation in the examples provided below. 

Since F is bounded, one can show quite generally that these Markov chains support quasi-stationary distributions 
whenever there are absorbing sets. For all of our examples, these absorbing sets are {0} or dMfi ={i£ BJl_ : Y[ x% = 0}. 
A proof of this assertion is given in Proposition 6.1 of Section 6. Under slightly stronger assumptions (namely A 
is continuous and J 1 , is strictly positive), we show in Proposition 6.3 of Section 6 that these Poisson processes also 
satisfy our large deviation Hypotheses 2.5 and 2.6. 

To provide a taste of the possible applications, we apply our results to three particular classes of nonlinear Poisson 
branching processes. 



Metapopulation dynamics 

A fundamental question in population biology is how do local demographic processes such as reproduction and 
survivorship interact with dispersal (a regional demographic process) to determine spatial-temporal patterns of 
abundance [Earn et al., 2000, Hastings and Botsford, 2006, Earn and Levin, 2006, Schrciber, 2010]. This issue has 
been studied extensively with discrete-time deterministic models representing space as a finite collection of patches 
connected by dispersal. To illustrate how our results apply to these metapopulation models, we introduce a stochastic 
version of the spatial Ricker map, which was originally studied by [Hastings, 1993] for 2 patches, and for which we 
allow an arbitrary number, k, of patches. 

Let 1/e > be the area or volume of a single patch, N^' 1 denote the number of individuals in patch i, = 
(Nt' 1 , . . . , Nf' k ) the vector of population abundances across space, and Xf = eNf the vector of population densities. 
To describe reproduction within a patch, let f{x) = /oexp(— x) be the mean fecundity of an individual when the 
local population density is x and the "intrinsic" fitness is /o > 0. The map x <— > xf(x) is known as the Ricker 
map in theoretical ecology and is commonly used to describe the population dynamics of a single species [Ricker, 
1954, Hastings, 1997, Wysham and Hastings, 2008]. Let D = (dij) be an irreducible, row-stochastic matrix where dij 
corresponds to the probability of an individual dispersing from patch i to patch j. Given Nf, we define the spatial 
Ricker process as follows 

• Each individual in patch i independently produces a Poisson-distributed number of offspring with mean f(X^' 1 ) 
to replace themselves. Let Z\^ x be the total number of offspring produced in patch i, which is Poisson distributed 

with mean N^' 1 f(X^' % ). We assume that the Z\f Xl . . . , Z\+ x are independent; there are no correlations in the 
reproductive output between distinct patches. 

• Independent of one another, offspring in patch i move to patch j with probability dij. To represent this 
movement, let W^ +1 (i) — (Wt+\($), • ■ • , Wt+i(i)) ^ e a multinomial random vector with sampling probabilities 
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dn, ■ ■ ■ , 
• Define 



dik and Zff t trials. 




J2Wt +1 (i) and XI +1 =eNl 



By our earlier observation about multinomial sampling of a Poisson random variable, {X 4 e } is a nonlinear Poisson 
process with 



Since \\F\\ < fo\\D\\ and D is irreducible, Proposition 6.1 implies the process {Xf} has a quasi-stationary distribution 
/x £ with respect to the absorbing state Mo = {0} for all e > 0. 

Let fx be a weak* limit point of [i e as e — > 0. To say something about the support of /z, we need to understand 
the dynamics of the map F(x). The simplest applicable result is a persistence and extinction dichotomy. Since 
Fi(x) < foXi, it follows that is a global attractor for F(x) whenever fo < 1. Alternatively, when fo > 1, a result of 
Kon et al. [2004, Theorem 3] implies that F(x) has a positive attractor. Lemma 3.9 b') and Theorem 3.12 imply the 
following result. 

Proposition 5.2 Let ft be a weak* limit point for quasi- stationary distributions [i e of the spatial Richer process 
{Xf}. Then 

Extinction if fo < 1, then /x({0}) = 1. 

Metastability if fo > 1, then there exists a 5 > such that [i(N s ({0})) = 0. 

In the limiting cases where the population is either weakly-mixed or well-mixed, we can say more about the support 
of the limiting measure [i. These stronger assertions rely on the one-dimensional map x i— > fox exp(-x) having a 
linearly stable periodic orbit, call it S = {p, F(p), . . . , F n ~ 1 (p)} where n is the period. Kozlovski [2003, Theorem C] 
proved that, for an open and dense set of fo values, such a stable periodic orbit exists. Hence, this assumption is not 
very restrictive. 

Theorem 5.3 Assume the one- dimensional map x i— > /oxexp(— x) has a linearly stable periodic orbit, call it S = 
{p, F(p), . . . , F n (p)y, and D is an irreducible, non-negative matrix whose row sums equal one (i.e. a row stochastic 
matrix). 

weakly mixing // D is sufficiently close to the identity matrix, then there exists nk linearly stable periodic orbits 

for F{x) and fi is supported by the union of these stable periodic orbits. 
strongly mixing // all the entries of D are sufficiently close to 1/k and the column sums of D equal one (i.e. D 

is doubly stochastic), then there exists a unique globally stable periodic orbit for F(x) and the support of fi is 

given by this periodic orbit. 

We remark that in the special case of a single patch, k — 1, we recover results of Hognas [1997], Klcbaner et al. 
[1998], and Ramanan and Zcitouni [1999] for one-dimensional maps on a compact interval. See Section 3.5 for further 
discussion about this point. 

Proof. To prove the first assertion, consider the uncoupled map 



Each of the components of this limiting map are given by the one-dimension map g(xi) = Xif(xi) which by assumption 
has a linearly stable periodic orbit S = {p,g(p), . . . ,5 ri ~ 1 (p)}. This linearly stable periodic orbit gives rise to n k 
periodic orbits of the form (g ni (p), . . . , g nk (p)) with < nj < n for F. Since g has a negative Schwartzian derivative 




F(x) = (xif(x 1 ),x 2 f(x 2 ), ■ ■ .,x k f(x k )). 
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and a single critical point, van Strien [1981, Theorem A] proved that the complement of the basin of attraction 
of S for g can be decomposed into a finite number of compact, g- invariant sets which have a dense orbit and are 
hyperbolic repellers: there exists C > and A > 1 such that \(g n )'(x)\ > CX n for all points x in the set and n > 1. 
Consequently, the fc-dimensional mapping F is an Axiom A endomorphism [Przytycki, 1976, pg. 271]: the derivative 
of F is non-singular for all points in the non-wandering set £l(F) = {x G : for every neighborhood U of x, 
F n (U) P\U 7^ for some n}, £l(F) is a hyperbolic set, and the periodic points are dense in £l(F). Results of Przytycki 
[1976, 3.11-3.14 and 3.17] imply that key attributes of Axiom A endomorphism are (i) Q(F) decomposes in a finite 
number of invariant sets . . . , f2 m (F), (ii) for each orbit {x n } C il l (F) of F, the unstable manifold at xo 

intersects £l l (F) in a dense set, and (iii) maps F sufficiently C 1 close to F are Axiom A endomorphisms. Property 
(iii) implies that F(x) = DF(x) is an Axiom A endomorphism provided that D is sufficiently close to the ident ity 
matrix. Property (ii) implies that each of the invariant sets SY(F) is a p-equivalence class. Linear stability of the n k 
periodic orbits (g ni (p), . . . , g nk (p)) with < rij < n for F implies that, for sufficiently small perturbations F of F, n 
of the invariant sets Q l (F) correspond to linearly stable periodic points, while the remaining invariant sets are either 
either hyperbolic repellers or saddles. Since the stable periodic orbits are the only p-quasi-attractors, Theorem 2.7 
implies the first assertion of the proof. 

We prove the second assertion. Since D is doubly stochastic, the non-negative half-line C = {x : x\ = ■ ■ ■ = Xk > 0} 
is F-invariant i.e. F(xil) = g(xi)l where 1 is the vector of ones. As in the case of the proof of the first assertion, the 
dynamics of F of restricted C has the stable periodic point (pl,g(p)l, g n ~ 1 (p)l) and the complement of its basin 
of attraction can be decomposed into a finite number, say to, of compact, (/-invariant sets which have a dense orbit 
and are hyperbolic repellers. Define D by dij = dij — dik for all By choosing dij sufficiently close to 1/fc for all 
we can make the matrix D as close to zero as we want. Hence, Earn and Levin [2006, Theorem 1] implies that 
£ is a global attractor for the dynamics of F. Moreover, the stable periodic orbit for F restricted to C is stable for 
F. Proposition 5.1 implies that each of these invariant sets is a p-equivalence class. Since the stable periodic orbit is 
the only p-quasi-attractor, Theorem 2.7 implies the second assertion. 

Competing species 

During the mid twentieth century, laboratory experiments played a key role in establishing the competitive exclusion 
principle in ecology. One classic set of competition experiments was conducted by Park [1948, 1954] with flour beetles. 
To model the dynamics of these competing beetles, collaborators of Park [Leslie and Gower, 1958] used difference 
equations, rather than the classical Lotka-Volterra differential equation model of competition. Cushing et al. [2004] 
showed that these difference equations exhibit the same dynamical outcomes as the Lotka-Volterra models. Namely, 
one or both species may go extinct for all initial conditions, may coexist about a globally stable equilibrium, or 
may exhibit contingent exclusion where the initially "more abundant" species excludes the other species. Here, we 
consider a stochastic counterpart of the Leslie-Gower model. 

Let A^ t e = (N^' 1 , N^ 2 ) and = sNf denote the abundances and densities of the competing species at time t. 
Once again, 1/e corresponds to the volume of their habitat. The pcr-capita mean fecundity fi{x) for species i is 
given by fiix) = 1+c ..^ i +c . — where j ^ i, bi > is the "intrinsic" birth rate, Cu > is the strength of intraspecific 
competition, and Cy > is the strength of interspecific competition. If individual births are independent given the 
current density of individuals and Poisson distributed with means fi(X^) i = 1,2, then N^ 1 is a nonlinear Poisson 
process associated with the map F(x) = {fi{x)x\, f2{x)x2)- Proposition 6.1 implies the Leslie-Gower process has 
quasi-stationary distributions [i e for e > with M = ={iel 2 : Xi > 0} and M = cM+ = {i£ : X1X2 = 0}. 
Results of Cushing et al. [2004, Theorem 4], our Theorem 3.12, and our Theorem 2.7 imply the following result. 

Theorem 5.4 Let fx s be a quasi- stationary distribution for the Leslie-Gower process {Xf}. Let /j a weak* limit point 
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of these quasi- stationary distributions. 

Coexistence Ifbi > 1 for i = 1,2 and Ci(bj — 1) < 6, — 1 for £ = 1,2 and i ^ j , then fi is a Dirac measure supported 
by the point 

f h-l / _ 6i - 1 \ 6i - 1 / _ 6 2 -l 
\cic 2 - 1 V 62 - 1/ ' cic 2 - 1 V &i — 1 

Extinction or Exclusion Ifbi<l for some i, or bi > 1 /or i = 1, 2, 62 — 1 > (bi — l)/ci and 61 — 1 > (62 — l)/c 2 , 
or 6^ > 1 for i = 1,2, 62 — 1 < (&i — l)/ c i an( ^ 61 — 1 < (62 — l)/ c 2j iften /i is supported by cM 2 ^. 

The case for which our results are not conclusive is when the dynamics of the Leslie- Gower model are bistable 
(i.e. bi > 1 for i = 1,2 and Ci(bj — 1) > 6, — 1 for i = 1,2 and i 7^ j) in which case there is a positive unstable 
equilibrium and all initial conditions not lying on its stable manifold (which has dimension one) go to However, 
we conjecture that fi is supported on the boundary of the positive quadrant in this case. 

Host-parasitoid interactions 

Predator-prey interactions involve one species benefiting by harming another species. These interactions are the 
fundamental building block for all food webs. An important class of predators are parasitoids such as wasps or flies 
whose young develop in and ultimately kill their host [Godfray, 1994]. Mathematical models of these interactions 
have been studied for almost a century [Thompson, 1924, Nicholson and Bailey, 1935, Hassell, 1978, 2000, May, 1995, 
Schreiber, 2006a, 2007, Gidea et al., 2011]. As predator-prey interactions are inherently oscillatory, these studies often 
focused on identifying mechanisms that stabilize predator-prey interactions. 

Here, we introduce a stochastic analog of these deterministic models. Let Hf and P t E denote the abundances of 
host and the parasitoid in generation t, respectively. Let A t E = eNf = e(i/ e ,P t £ ) be their densities where 1/e is the 
size of the environment. Let /(A^' 1 ) be the mean number of offspring produced by an individual host. Let g{Xf) 
be the probability that an offspring escapes parasitism from the parasitoids. We update the population state Af 
according to the following rules: 

• Each adult host independently produces a Poisson distributed number of offspring with mean /(A t e,:L ). Let 
M t+ i be the total number of offspring which is Poisson distributed with mean Hff(Xl'). 

• Each offspring survives parasitism independently with probability <7(A t £ ). Let H^ +1 equal the number of surviv- 
ing offspring and P t e +1 = Mt-\-i ~ Ht+i ^ e the number of parasitized offspring which all emerge as parasitoids 
in the next generation. 

Since (Hf +1 , P t e +1 ) is binomial distributed with M t +\ trials, (iJ e +1 , P t e +1 ) are independent Poisson random variables. 
Hence, A e is a nonlinear Poisson process associated with the map 

F ( x ) = (f(xi)xig(x)J(x 1 )x 1 (l - g(x))). 

on = {{x\,X2) G R 2 : Xi > 0}. Provided that F is continuous and / is a compact map, Proposition 6.1 implies 
that there is a quasi-stationary distribution [i e for A t e with e > 0. 

To understand the support of the weak* limit points fj, of /i e , we focus on a generalized Thompson model [Thompson, 
1924, Getz and Mills, 1996, Schreiber, 2006a, 2007]. For this model, /(an) = exp(r(l -Xi/K)) is given by the Ricker 
equation where r > is the intrinsic rate of growth of the host and K > is the host's carrying capacity. The escape 
function g(x) = (1 + x 2 / {bx\k))~ k corresponds to a negative binomial escape function with egg-limited encounter 
rates. Here, 6 > is the attack rate of the parasitoid and 1/fc > represents how "clumped" or "aggregated" par- 
asitoid attack are, i.e. smaller k correspond to greater aggregation of parasitoid attacks. Notice that while g is not 
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defined at x± = 0, the map F extends continuously to x\ = if we set F(x) = whenever x\ = 0. Combining results 
from Schreiber [2007, Theorem 3.1,3.2] and Theorem 2.7 yields the following results for k < 1 i.e. parasitoid attacks 
are sufficiently aggregated [Hassell et al., 1991]. 

Theorem 5.5 Let (i be a weak* limit point of the quasi- stationary distributions for the Thompson host- parasitoid 
process Xf. Assume k < 1 and define 

y* = max{y > : exp(-r) ((1 + y/(bk)) k - l) = y}. 

Then 

Extinction 7/exp(r)(l + y*/(bk))~ k < 1, then ft is supported by the <9R+ = {i£ Rl : X\X% = 0}. 
Coexistence 7/exp(r)(l + y* / (bk))~ k > 1, then /i is supported by R+ \ dM.^. Moreover, for an open and dense set 
of parameter values (r,b) satisfying exp(r)(l + y* / (bk))~ k > 1, a is supported by a periodic orbit. 

When k > 1 (i.e. parasitoid attacks are not sufficiently aggregated), Schreiber [2007, Theorem 3.1] implies coexistence 
does not occur for the deterministic model. However, this extinction often involves unstable sets in the interior of 
R+. Consequently, our results are not applicable. Nonetheless, we conjecture that ji is supported by &M? + . 



5.3. Multinomial processes 

Consider a landscape with N sites that can be in one of k states. These states may correspond to occupation by 
individuals playing different strategies in the context of evolutionary games, or different genotypes in the context of 
population genetics. Let M — {x € R fe : Xi > 0, X)i=i x i = 1} be the fc-simplex and F : M — » M be a continuous 
map. 

For each x G M , let Z\(x), ^(x), ... be a sequence of independent random vectors with a multinomial distribution 
with N trials, k possible outcomes, and probability Xi of producing type i in a single trial. If e — 1/N and Xq € 
M n e1 k is given, then we can define a Markov chain {Xf }£2. on M n eZ fe iteratively by 

Xl +1 =eZ t+1 (F{X;)) 

Since {X t £ } is a finite-state Markov chain, quasi-stationary distributions exist uniquely whenever the transition 
matrix restricted to the transient states is aperiodic and irreducible [Darroch and Seneta, 1965]. When Mq is the 
boundary of the simplex and Fi(x) — Xifi(x), with fi continuous and positive, we therefore always have a unique 
quasi-stationary distribution, and we prove in Proposition 6.5 of Section 6 that the large deviation hypotheses are 
satisfied. As a particular application of these multinomial processes, we consider evolutionary games. 



Evolutionary game dynamics 

Evolutionary game theory studies the dynamics of populations of players, each programmed to play a fixed strat- 
egy throughout their life time [Hofbauer and Sigmund, 1998, 2003, Cressman, 2003]. These populations often ex- 
hibit frequency dependent selection; the reproductive success of a player changes in time due to the composition 
of strategies in the population. The study of evolutionary games has led to fundamental insights into the evolu- 
tion of animal conflicts [Maynard Smith, 1974], cooperation [Nowak ct al., 2004, Imhof and Nowak, 2010], habitat 
selection [Schreiber et al., 2000, Cressman et al., 2004], and mating systems [Sincrvo and Lively, 1996]. 

A basic deterministic model for evolutionary games is the discrete-time replicator equation [Hofbauer and Sigmund, 
2003] 



Fi{x) 
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where xi is the frequency of strategy i in the population, the entries Oy of the "pay-off" matrix A describe the fitness 
gain to strategy i when interacting with strategy j, and c is the "basal" fitness of an individual. The dynamics of this 
discrete system have been studied extensively [Hofbauer and Sigmund, 1998, 2003]. Here, we describe a stochastic 
analog of these games that account for finite population sizes. In the case of two-strategies, this stochastic analog 
corresponds to the frequency dependent Wright-Fisher processes studied by Imhof and Nowak [2006]. 

Let Nf' 1 and X\ ,% denote the abundance and frequency of the i-th strategy at time t. Here, 1/e is assumed to 
be an integer that corresponds to the total population size which does not change over time. If we update 7V t e by 
taking a multinomial random variable with 1/e trials and probabilities F{X^), then we get a multinomial process. 
One can interpret this process as individuals producing many offspring proportional to their fitness J^ . a%jXj +c, and 
randomly selecting individuals from the offspring "pool" to replace their parents. For this stochastic process {X^}, 
M = {x 6 M : JJ x,i = 0} is an absorbing set that corresponds to the loss of one or more strategies. 

We can leverage two results from the theory of replicator dynamics to describe the support of the quasi-stationary 
distributions /it E when e > is sufficiently small and the basal payoff c is sufficiently large. To first order in 1/c, 



Fi(x) ks Xi + x^ i^Z 



a ij X j ^""^ QjlXjXl 
jl 



and the dynamics of the map x F{x) can be viewed as a Cauchy-Euler approximation to the classical continuous 
time replicator equations 



dxi 

~dt ~ Xi 



j jl 



Using this observation and work of Hofbauer and Sigmund [1998], Garay and Hofbauer [2003], we can prove a suf- 
ficient condition for the stochastic replicator processes exhibiting metastable persistence for e > small and c > 
large. 

Theorem 5.6 Let \i be a weak* limit point of the quasi- stationary distributions fi e for the replicator process with 
c > sufficiently large. If there exists pi > such that 



Eft 



jl 



at every equilibrium x £ Mq for F , then fx is supported by Mi . 

Proof. Hofbauer and Sigmund [1998, Theorem 13.6.1] implies that the continuous-time replicator equations (11) 
have a global attractor A C Mi whenever there positive weights Pi,...,Pk such that 

» \ j ji j 

at every equilibrium x £ Mq for F. In particular, in the terminology of Garay and Hofbauer [2003, Definition 2.1], (11) 
admits a good average Lyapunov function whenever these positive weights exist. Consequently, Garay and Hofbauer 
[2003, Theorem 8.3] implies that the discrete-time replicator equation F admits a global attractor A C Mi whenever 
c is sufficiently large. This global attractor, however, need not be a unique p-quasi attractor. However, as suggested 
by Comment 3.8, we can apply Lemma 3.9b' to conclude that /i(Vb) = for some neighborhood Vo of M . ■ 
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As an interesting special case consider the rock-paper-scissor game where the payoff-matrix is of the form 

/ -oa b 3 \ 
A = 61 -as 
\-ai 62 / 

with dj and 6^ positive. Zecman [1980] proved that if det(-A) > 0, then the persistence condition of Theorem 5.6 is 
satisfied. Moreover, for the continuous-time replicator equations, there is a globally stable internal equilibrium. For 
c > sufficiently large, this equilibrium is also globally stable for the map F and, consequently (i is a Dirac measure 
supported by this equilibrium. When det(^4) < 0, Zeeman proved that the internal equilibrium is unstable and all 
other orbits of the continuous-time deterministic system approach the boundary and one can show that the same 
conclusion holds for the discrete-time system when c > is sufficiently large. Since the boundary is not a global 
attractor in this case, we can not apply Theorem 3.12. None the less, we conjecture that [i is supported on the 
boundary Mq in this case. 

6. Large deviation results for Poisson and multinomial models 

In this section we prove the existence of quasi-stationary distributions (as needed) and verify our large deviation 
hypotheses for the Poisson and multinomial models introduced in Section 5. 

6.1. Nonlinear Poisson branching model 

We first prove the existence of quasi-stationary distributions for the nonlinear Poisson processes introduced in Section 
5.2. 

Proposition 6.1 If sup xe ^ ||_F(a;)|| < 00 and Fi(x) > for all x G M \ Mq and i, then the nonlinear Poisson 
process {^ t e } associated with F has at least one quasi- stationary distribution supported on M \ Mq. 

Proof. For notational convenience, we prove this result for the Markov chain {N£} and call p £ (x,y) its transition 
kernel. Let X = Th_ \ Mq and q £ (x,y) denote the restriction of p £ to X. Let ^(X) denote absolutely summablc 
functions from X to R. For u £ l x (X), define ||u||i = ^ x |w(x)|. 

We can define a linear operator Q £ from V~(X) — > l x (X) by (uQ £ )(x) — J2 y ex ^(Vi x ) u (v)- Recall that /i e is a 
QSD for the Markov chain N £ if and only if it is a non negative eigenvector of the operator Q £ . Since Fi(x) > for 
all i and x G M \ Mq, q £ (x 1 y) > for all x, y G X. 

Next, we show Q £ is a compact operator i.e. the image of the unit ball under Q £ is precompact. Recall that a 
closed set C in ^(X) is compact if and only if it is bounded and equisummable: for all S > 0, there exists such 
that sup„ eC X^||x||>jv \ u ( x )\ ^ Defining G(x) = F(ex)/e, we get 

? £ (x,y) = n^T^exp(-G«(x))<n^, where m = sup ||G(x)|| 
»=i Vl - 1=1 Vi - x£X 
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Hence, for u with ||u||i < 1, 



\\uQ £ h 



X 



< 



x y i=X 



k 




< 



En 



x i—1 



Moreover, given 5 > 0, choose N > such that X)||a;||>jv Ili=i mXi /xi ] - < $■ Then J2\ x \>n \( u Q e )( x )\ — $ f° r an u 
such that ||u||i < 1. Hence, Q e is a compact operator. 

On the other hand, given that q £ is strictly positive on X x X, we have for any Y C X, 



Applying the following result of Jentzsch on Kernel positive operators completes the proof of this proposition. 

Theorem 6.2 [Schaefer, 1974, Theorem V.6.6.] Let E = L p (/j,), where 1 < p < +oo and (X,'E,fi) is a a-finite 
measure space. Suppose Q £ £-{E) is an operator given by a (E x E) -measurable kernel q > 0, satisfying the following 
two assumptions: 

(i) Some power of Q is compact, 
(ii) Y c E ; n{Y) > and £t(E \ Y) > implies 



Then the spectral radius r(Q) is positive, is a simple eigenvalue, its unique renormalized eigenvector v satisfies 
v(x) > 0, fi almost surely and r(Q) is the only eigenvalue of Q with a positive eigenvector. Moreover, if q{x,y) > 
{fi <8 /i) almost surely then every other eigenvalue of Q has modulus strictly smaller than r{Q). ■ 

The following proposition verifies the large deviation hypotheses of Section 2 for the nonlinear Poisson processes. 

Proposition 6.3 Assume that x H> F{x) is continuous, Fi(x) > for all i and x G M \ Mq, and sup^ggt ||-F(x)|| < 
oo. Then the nonlinear Poisson process {A" t e } associated with F satisfies Hypotheses 2.5 and 2.6. 

Proof. Let denote the distribution of the Poisson random vector Xf +1 conditional to Xf = x. The logarithmic 
moment generating function relative to [i x e is given by 



Hence, the family sA e . x (-/e) is identically equal on R fe to the function A-^A) = J2i=i Fi(x)(e Xi — 1). Thus, by the 
Gartner-Ellis Theorem (see, e.g., [Dembo and Zeitouni, 1993, Theorem 2.3.6]), the family ^ satisfies a large deviation 



]r£g%^)>0 



x^YyGY 




X\Y JY 




k 
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and for any open set G C 

liminfe log /4(G) > - inf A* x (y). 

e-s-0 y£G 

Hence, if we define 

p{x,y) = A* x (y) 

then p immediately satisfies (i), (ii) and the upper bound of (iv) of Hypothesis 2.5. 

Now let us derive the uniform lower bound of Hypothesis 2.5. Pick a compact set K C M\ and an open ball 
B C M. Let B £ = B n (eN fc ) be the e-lattice on B. Let a > 0. For every x £ K, there exists y(x) 6 B such that 
p(x,y(x)) < inf^gs p(x,y) + a. Choose Eq > small enough such that 

d (y, V 1 ) < £o => \p(x, y) - p(x,y')\ <a 

for all y,y' E B and x £ K. For each x £ K and each < e < £o, we can choose a point y s (x) = e{n\, . . . , n|) such 
that d(y £ (x),y(x)) < e and, consequently, p(x,y £ (x)) < infj, 6 s y) + 2a. For all < e < Eo and all x S -ftT, we 
have 

> £ ({«.(«)}) = Ht.e-^W/- ^^" 1 . 

Recall that, for any p G N, — p — \ogp\ + plogp > — (1 + \ogp) and define I + = {i £ {l,...,fc} : n\ > 0}. A 
straightforward computation gives 

Fi{x) 



e\o gl 4{B) > -^2Fi(x)+J2 enfIog-5p-elog(nf!) 



p(x,y e (x)) + £n t (- 1 ~ ^log«0 +lognf j 



> —2a — inf p(x, y) — e > (f + log i 



yeB 

The last quantity goes to zero as s goes to zero, independently of x since n\ is of order (y e (x))i/e and the quantities 
(y e (x))i are bounded. Hence, we have shown that the lower bound (4) holds uniformly for any compact set K C M\ 
and open ball B. 

To verify that p satisfies (iii) of Hypothesis 2.5, we first prove the following lemma: 
Lemma 6.4 Define g : (0, oo) x [0, oo) — > [0, oo) by g(x, y) = y log ^ + x — y. Then for all 5 > and m > 0, 

inf {g(x, y) : \x — y\ > 8, x < m} > a > 

Proof. Let m > and 8 > 0. We have that C :— {(x,y) : |x — y\ > 8, x < m} = A U B with A = {(x,y) : 
m > x > 8, < y < x — 8} and B = {(x, y) : y > 8, < x < y — 8}. Since A is compact and g restricted to 
A is positive and continuous, ixd( x ^^Ag{x, y) > 0. Restricted to B, g(x,y) is positive and increasing in y. Hence, 
inf(i,j) £ B g(x, y) = ml Q<x < m g(x, x + 8). Since 

-^-g(x, x + S) = log(l + 8/x) -8/x<0 
ax 

we get mf 0< x<m g(x, x + 8) = g(m, m + 8) > 0. Thus, in£( x>y ) eC g(x, y) > 0. ■ 
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Let now 8 > be given, d(x, y) = max^ \n — yi\, and g be as denned in the lemma. If \yi — Fi(x)\ > 8, then Lemma 
6.4, with m = sup^t ||F(x)||, implies that p(x,y) = Y,j d{F j {x),y j ) > g{F. l {x),y l ) > a. 

To check that the uniform upper bound (5) holds, notice that it is sufficient to prove that quantities p x ^[Fiix) + 
8, +oo [) (where fJ-^(-) is the i-marginal of [i*, namely the distribution of the % component of X^ +1 , conditional to 
= x) are bounded above by some expression which goes to zero as e goes to zero, uniformly in x £ K . This is an 
easy consequence of Chernov upper bound: 

< 4 ([^(x) +*,+«>[) < e-^WiW-H) < e"^, 

where /3 = inf {g{x, y) : |x — y\ > 8, x < m} > and the quantity on the right hand side goes to zero uniformly in 
x by {Hi). 

Finally to verify Hypothesis 2.6, notice that p £ (x, 0) = exp(— ^ Fj(x)/e) and recall that F(0) = 0. Hence, given 
c > 0, choose a neighborhood Vq of {0} such that Fi(x) < c whenever x £ Vb- Then elogp £ (x,0) > — c whenever 

a; e v . m 



6.2. Multinomial model 

Here we verify the large deviation assumptions for the the multinomial processes introduced in Section 5.3. 

Proposition 6.5 Assume Fi(x) = Xifi{x), with fi continuous and positive. Then the multinomial process {X%} 
associated with F satisfies Hypotheses 2.5 and 2.6 with respect to the absorbing set M = {x £ M : LJ x.; = 0}. 

Proof. Let [i x N be the law of the multinomial random vector jjZi(F(x)), which can be written as -k Y^,iLi Yi(F{x)), 
where (Yi(F(x))i is an i.i.d sequence with distribution F[Yi(F(x)) = ej) = Fj(x) (e_, is the unitary vector in R fc 
whose j-th component equals one). By Cramer's Theorem (see, e.g., [Dembo and Zeitouni, 1993, Theorem 2.2.30]), 
the sequence pt x N satisfies a large deviation principle with convex rate function A* (y) = J2i=i Vi 1°§ p V (x) ' Hence, if 
we define p(x,y) = A*(y), then p immediately satisfies (i), (ii), (iv) of Hypothesis 2.5. 

The proof of the uniform lower bound is similar to the Poisson branching process case. Let K C Mi be a compact 
set and B C M an open ball. Let B N = B fl ^-N fe . Let a > be given. For every x £ K, there exists y(x) £ B such 
that p(x,y(x)) < inf ye s p{x,y) + a. Choose Nq > 1 sufficiently large such that 

d(y, y') <1/N ^ \p{x, y) - p{x,y')\ < a 

for all x £ K and y, y' £ B. For each x £ K and N > Nq, we choose yjv(x) = jf(n^, . . . , nf) such that 
d(y N (x), y(x)) < l/N. Let 1+ = {i : nf > 0}. For N large enough, 

^-logp x N (B) > ^-logfi x N ({y N {x)}) 



N °^ JVV ' ~ N 



> -p(x, y N {x)) + ( 1 + i log AH - log Jv) + 1 nf (-1 - ^ log nf ! + logn^ 



at ° In 



> -2a- inf p(x,y)-i- ^(1 + lognf) 



yes rv AT . 
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The last quantity goes to zero as TV — > oo. independently of x since nf is of order N(yjy(x))i and the quantities 
(yN(x))i are bounded. Hence, we have shown that the lower bound (4) holds uniformly for any compact set K C M\ 
and open ball B. 

To verify that p satisfies (iii) of Hypothesis 2.4, assume by contradiction that there exist /3 > and two sequences 
(x n ) n and (jjn)n in the fc-simplex M, converging respectively to x and y, and such that 

limp(x„,y„) = and d{x n ,y n ) > /3. 

n 

Define Iq = {i G {1, . . . , k} : x l — 0}. Notice that, if i G Iq, then lim„ y\ log |f = if y l — and y^ log ~ — > +oo 

otherwise. As a consequence, y l = Vi G io- We consider two cases separately. 

Assume first that all the components of a; are zero except the first one: Iq = {2, . . . , k}. Then y % = for i = 2, fc, 
which implies that a; = y, a contradiction. 

Assume now that Jo contains at most k — 2 terms. Call /i its complementary and assume without loss of generality 
that ii = {l,...,m}. Define x = {x 1 , ...,x ni }, y = {y 1 , y ni } and notice that x G Int(A ni ) and y G A™ 1 , where 
A™ 1 is the ni-simplex. Define analogously the sequences (x n ) n and (y n )n, which belong to the set {u G R™ 1 : u l > 
u 8 < 1}. Let now p be the application given by 

(u, v) G R™ 1 x R" 1 ^ v * lo S — ■ 

This map is continuous and strictly positive in (x, y), since \x — y\ > (3. Therefore, there exists 8 > and r > such 
that p > 5 on Br«i (J, r) x i?R"i (y, r). This concludes the proof since 

P~{x n , Vn) + 2/ri log -M > /? + lim V log -f = f3. 

tela n J ielo " 

The uniform upper bound (5) holds by an application of Chernov upper bound. Let 8 > 0. Then 

where f3 = inf {y log f , (x, y) G (0, l] 2 , \x - y\ > 8} > 0. 

Finally to verify Hypothesis 2.6, we have p £ (x, Mo) > max^ (1 - Fi{x)y/ £ . Hence, elog p £ (x,M ) > maxilog(l - 
Xifi(x)). Given c > 0, choose a neighborhood Vq of Mo such that min^ < 1 — e _c whenever x G Vq- Then 
elogp e (a;, 0) > — c whenever a; G Vq. B 
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